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DESCRIPTIVE STATISTICS 


‘The word statistics comes from the Latin status, meaning “state.” 
This derivation reflects that numerical information was bistorically 
gathered and interpreted by governments 

‘The ability to describe data in various ways bas always been 
valuable. Now, however, these techniques are becoming more im- 
portant because of the tremendous, ever-growing number of data 
being collected in almost all areas of knowledge. The need to make 
sense out of masses of information bas led to the development of 
formalized ways of describing data. To make intelligent decisions, 
as well as to understand the world around us, we must become 
familiar with these primary methods of portraying data 

Given a raw set of data, often we can detect no overall pattern 
Perbaps some values occur more frequently, a few extreme values 
may stand out, and usually the range of values is apparent. The 
presentation of data, including summarizations and descriptions, 
and involving such concepts as representative or average values, 
measures of dispersion, positions of various values, and the shape 
ofa distribution, falls under the broad topic of descriptive statistics, 
the subject of Part 1 of this book. This aspect of statistics is in 
contrast to analysis, the process of drawing inferences from limited 
data, a subject that is discussed in later chapters. 

More specifically, Chapters 1-4 are concerned with (1) bow to 
measure central tendency; (2) bow to measure variability; (3) bow 
to measure position; and (4) bow to display shave. 


CENTRAL TENDENCY 


‘The word average is used in phrases common to everyday conversation. 
People speak of bowling and batting averages or the average life expectancy 
of a battery or a human being. Actually the word average is derived from the 
French avarie, which refers to the money shippers used to contribute to help 
compensate for the losses suffered by other shippers whose cargo did not 
arrive safely (that is, the losses were shared, with everyone contributing an 
‘average amount). In common usage average has come to mean a repre- 
sentative score or 4 typical value or the center of a distribution. Mathematically, 
there are a variety of ways to define the average of a set of data, In practice, 
‘we use Whichever method is most appropriate for the particular case under 
consideration. 

In the following paragraphs we consider the three primary ways of denoting 
an average: 


1, The median, which is the middle number of a set of numbers arranged 
in numerical order. 

2. The mode, which is the number that occurs most frequently. 

3. The mean, which is found by summing items in a set and then dividing 
by the number of items. 


Erample 1-1 Consider the following set of homerun distances (in feet) to center 
field in 13 ballparks: (387, 400, 400, 410, 410, 410, 414, 415, 420, 
420, 421, 457, 461). What is the average? 


Answer: The median is 414 (there are six values below 414 and 
six values above). The mode is 410 (occurs three times). The mean 
is 


+ 400 + 400 + 410 +... + 457 + 461 


“ 4173 feet 


MEDIAN 


The word median is derived from the Latin medius which means “middle.” 
‘The values under consideration are arranged in ascending or descending order. 
If there is an odd number of values, the median is the middle one. If there are 
an even number, the median is found by adding the two middle values and 
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dividing by 2. Thus the median of a set has the same number of elements 
above it as there are below it 

The median is not affected by exactly how lange the langer values are or by 
‘exactly how small the smaller values are. Thus it is a particularly useful measure- 
ment when the extreme values, called outliers, are in some way suspicious, or 
when we want to diminish their effect. For example, if ten mice try to solve a 
maze, and nine succeed in less than 15 minutes, while one is still trying after 24 
hours, the most representative value is the median (not the mean, which is over 
1 hour). Similarly, if the salaries of four executives are each between $240,000 
and $245,000 while a fifth is paid less than $20,000, again the most representative 
value is the median (the mean is under $200,000). 

In certain situations the median offers the most economical and quickest way 
to calculate an average. For example, suppose 10,000 light bulbs of a particular 
brand are installed in a factory. An average life expectancy for the bulbs can most 
easily be found by noting how much time passes before exactly one-half of them 
have had to be replaced. The median is also useful in certain kinds of medical 
research, For example, to compare the relative strengths of different poisons, a 
scientist notes what dosage of each poison will result in the deaths of exactly 
‘one-half the test animals. If one of the animals proves especially susceptible to a 
particular poison, this median lethal dose is not affected. 


MODE 


‘The mode, or most frequent value, is an easily understood representative 
score. It is clear what is meant by “The most common family size is four” or 
“That professor gives out more Bs than any other grade,” If exactly one value 
must be chosen as a basis for certain decisions, this value is often the mode. 
For example, if a supplier can expand production of only a single item, he 
may well decide upon the one most frequently sold. However, since the mode 
presents the typical result, its use tends to rule out further arithmetic calcula- 
tions based on this result. For example, knowing that a certain player is the 
most popular with the fans does not lead to calculations based simply on this 
knowledge. 

When two numbers in a set have equal frequency, and this frequency is 
higher than any other, we say that the set is bimodal. Even when one frequency 
is greater than the other, but both are above the other frequencies, the set will 
sometimes be termed bimodal. Occasionally, the values in a set are distributed 
so evenly that the idea of a modal value has very little meaning for that set. 


MEAN 


While the median and mode are often useful in descriptive statistics, the 
mean or, more accurately, the arithmetic mean, is most important for statistical 
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inference and analysis. Also, for the layperson, the “average” is usually 
understood to be the “mean.” 

The mean of a whole population (the complete set of items of interest) is 
often denoted by the Greek letter ys (mu), while the mean of a sample (a part 
of a population) is often denoted by . For example, the mean value of the set 
Of all houses in the United States might be 4 = $56,400, while the mean value 
of 100 randomly chosen houses might be ¥ = $52,100 or perhaps X = $63,800 
or even X = $124,000, 

In statistics we leam how to estimate a population mean from a sample 
mean. Throughout this book, the word sample implies a random sample, that 
is, a sample selected in such a way that each element of the population has 
an equal chance to be included in the sample. In the real world, this process 
cof random selection is often very difficult to achieve. 

Mathematically, the mean = (Sx)/n, where Sx represents the sum of all the 
elements of the set under consideration, and is the actual number of 
elements. ¥ is the uppercase Greek letter sigma. 

Unlike the mode or median, the mean is sensitive to a change in any value, 
as shown in Example 1.2, 


Erenpe 12 Suppose that the numbers of unnecessary procedures recom: 
mended by five doctors in a I-month period are given by the set 
12, 2, 8, 20, 33). Note that the median is 8, the mode is 2, and the 


mean js 2+ 2+ 8420 + 33 13.1 és discovered that the fifth 


3 
doctor recommended an additional 25 unnecessary procedures, 
how are the median, mode and mean affected? 


Answer: The set is now (2, 2, 8, 20, 581. The median is still 8 
and the mode is still 2. The mean, however, changes to 
2+2+8+20458 4g 


Adding the same constant to each value will increase the mean by a like 
amount. Similarly, multiplying each value by the same constant will multiply 
the mean by a like amount. (See Study Questions 1 and 2.) 


Example 13 Suppose the salaries of six employees are $3000, $7000, $15,000, 
$22,000, $23,000, and $38,000. 
4, What is the mean salary? 


Answer: 


3000 + 7000 + 15,000 + 22,000 + 23000 + 38000 


% = $18,000. 
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‘6, What will the new mean salary be if everyone receives a $3000 


increase? 
Answer: 

S00 + 10000 + 18000 + 25000 + 26000 + 41000 _ 524959, 
Note that $18,000 + $3000 = $21,000. 
¢. What if everyone receives a 10% raise? 
Answer: 

3300 + 700 + 165500 + 24,200 + 25300 + 41800 _ 519.559 


6 

Note that 110% of $18,000 is $19,800. 

Because many real-life applications of statistics involve comparisons of two 

populations, it is important for later use to note how the mean is calculated 
for a set of differences. 


Suppose set X = (2, 9, 11, 22) and set Y = (5, 7, 15). Note that the 


mean of set X is py= 2227 UL * 22 = 1) and the mean of set 
Y is py=2*2*15 <0 Form the set Z of differences by 


3 
subtracting each element of ¥ from each of X: 


Z = (2-5, 2-7, 2-15, 9-5, 9— 
22-5, 22-7, 22-15) 
(+3, -5, -13, 4, 2, -6, 6 4, —4, 17, 15, 7) 


7. 9-15, 5, 7, 11=15, 


‘What is the mean of 2? 


Answer: 


= rB—S- 144 +2- 64644-4417 1547 
ne 12 


Note that fh, = By ~ By 

In general, the mean of a set of differences is equal to the difference of the 
means of the two original sets. Even more generally, if a sum is formed by 
picking one element from each of several sets, the mean of such sums is simply 
the sum of the means of the various sets. 
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Three secondary procedures for denoting an average are the barmonic 
‘mean, the geometric mean, and the trimmed mean. 


HARMONIC MEAN 


‘The harmonic mean of a set of n numbers is defined to be 


= 


ry 


that is, m divided by the sum of the reciprocals of the numbers. 


! 


Suppose four kinds of nails cost $0.01, $0.02, $0.03, and $0.04 
apiece. What is the average cost of a nail if we purchase $3.00 
worth of each kind? 


Answer: The number of $0.01 nails purchased is 3.00/0.01 = 300, 
Similarly, the numbers of $0.02, $0.03, and $0.04 nails purchased 
are 3,00/0.02 = 150, 300/003 = 100, and 3.00/0.04 = 75, 
respectively. The total number of nails purchased is 300 + 150 + 
100 + 75 = 625, while the total amount spent is 4(S3.00) = $12.00. 
Thus the average cost per nail is 12.00/625 = $.0192. 


Note that the answer could have been calculated immediately as 
the harmonic mean of the set (0.01, 0.02, 0.03, 0.04 


——— 
pe ire Cyne erie 
Gor * On02 * 003 * Gos 


0192 


Also note that if we bought 100 of each of the four kinds of nails 
(that is, same number of each kind rather than same money spent 
‘on each), then the average spent per nail would be (01 + .02 + 
03 + 04/4 = $025, 


GEOMETRIC MEAN 


‘The geometric mean of a set (ji, ¥2y-» Yl is defined to be 


that is, the rth root of the product of the elements. In particular, the geometric 
mean of two numbers is the square root of their product. The geometric mean 
is appropriate in situations of what is known as geometric growth. 


Example 16 The population of the United States in 1820 was 9,638,453 and in 


1840 was 17,069,453. Estimate the population in 1830 using the 
‘geometric mean. 


Answer: 


¥(9638,553)07,009,453) = 12826657 


(The actual value was 12,866,020.) 


TRIMMED MEAN 


‘The trimmed mean represents an attempt to reduce the influence of extreme 


values (outliers) on the mean. It is calculated by arranging the terms in 
numerical order, deleting the first quarter and the fourth quarter of the values 
and then finding the arithmetic mean of what remains. (Alternatively, instead 
of 25%, any percent, for example 5% or 10%, can be taken away from both 
the upper and lower ends, depending on the makeup of the particular set 
under consideration.) 


Exe 17 The World Bank has predicted that the 20 most populous coun- 


tries in the year 2100 and their populations (in millions) will be 
as follows: 


India, 1632; China, 1571; Nigeria, 509; Russia, 376; Indonesia, 356; 
Pakistan, 316; United States, 309; Bangladesh, 297; Brazil, 293; 
Mexico, 196; Ethiopia, 173; Vietnam, 168; Iran, 164; Zaire, 139; 
Japan, 128; Philippines, 125; Tanzania, 120; Kenya, 116; Burma, 
112; and Egypt, 111. Calculate the mean and the trimmed mean 
for the average predicted populations of these countries, 


Answer: The mean is (1632 + ... + 111)/20 = 361 million, while 
the trimmed mean, without the extreme influence of India, China, 
and Nigeria is (316 + ... + 128)/10 = 218 million. 


EXERCISES 


1. In the second game of the 1989 World Series between Oakland and San 
Francisco (played 2 days before the northern California earthquake post- 
poned the series), ten players went hitless, eight players had one hit 
apiece, and one player (Rickey Henderson) had three hits. What was the 
mean, median, and mode number of hits? 


2, In 1993 the eight countries in which the most patents were filed were as 


follows: Japan, 208,347; United States, 57,890; Germany, 42,922; France, 
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11,187; Britain, 9333; China, 5566; Italy, 3726; and Australia, 3315 (The 
Economist, August 27, 1994). What are the median, mean, and trimmed 
mean averages for the number of 1993 patents in these eight countries? 


3, On August 23, 1994, the Say total returns for equal investments in foreign 
stocks, U.S. stocks, gold, money market funds, and treasury bonds aver- 
aged 0.086% (Business Week, September 5, 1994). If the respective returns 
for foreign stocks, U.S. stocks, gold, and money market funds were 
+0.70%6, 0.1196, +1.34%, and +0.05%, what was the return for treasury 
bonds? 

4, The weights (in pounds) of the members of the 1968 Chicago Bear 
offensive line were as follows: Bob Pickens, 255; Jim Cadile, 240; Mike 
Pyle, 250; George Seals, 270; Rufus Mayes, 260. What was the average 
(mean) weight of these linesmen? What would the average weight have 
been if each of the players had gained 10 pounds? If each had increased 
his weight by 10 percent? 

5, Fecal pollution can lead to illness among swimmers in small lakes, The 
Environmental Protection Agency recommends that enterococcus counts 
at freshwater swimming areas not exceed a geometric mean of 33 per 
deciliter (EPA report no. 440/5-84-002). If three samples of lake water 
taken at different sections of a swimming area show pathogen counts of 
25, 30, and 40 organisms per deciliter, determine whether the geometric 
mean exceeds the EPA limit. 

6. The results of a recent University of Chicago study reported in The New 
York Times are tabulated below: 


o 1 Esl 510 n-20 ny 
Men En 20% 21% 2% 16% 1™% 
Women ES 326 36% 20% om am 


In which category does the median number of partners fall for men? For 
women? Explain. 

7. In your daily newspaper, find examples of two sets of numbers, in one of 
which the mean is greater than the median, and in the other the mean is 
less than the median. Can you find an example of data where the mean 
and median are equa?” 


Study Questions 
1, Show that the addition of the same constant to each value in a set 
will increase the mean by a like amount. (Hint: suppose that the 
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set has the m elements: X;, 3, Xs) %~ Then the mean is: 


p= LEA * A" show thatthe mean ofthe set Ly + 6,2 + 
CRF ny He t disptcl 

2. Prove that multiplying each value in a set by the same constant will 
multiply the mean by a like amount. 

3. Questions 1 and 2 form the background for a technique called coding 
which in certain cases provides a shortcut for computing the mean. Using 
these addition and mubiplication principles, how would you mentally 
calculate the mean of the set 25.002. 25.015, 25.013, 25.008 25.009, 
25.0017 

4, Suppose X = bey, Xs Ngl and Y= [yy 92, «5 Yale Form the set of differ- 
ences Z = Lx, ~ y).4=1,2,..,mand/= 1, 2, ... m. Show that pis = jay — pty 


[Hint Lz =nEx-—mZy. Whe Then EE = 9 


VARIABILITY 


In describing a set of numbers, not only is it useful to designate an average 
value, but also it is important to be able to indicate the variability or the 
dispersion of the measurements. A producer of time bombs aims for small 
variability—it woul! not be good if his 30 minute fuses actually had a range 
‘of 10 to 50 minutes before detonation. On the other hand, a teacher interested 
in distinguishing the beter from the poorer students aims to design exams with 
large variability in results—it would not be helpful if all her students scored 
‘exactly the same. The players on two basketball teams may have the same 
average height, but this fact doesn't tell the whole story. If the dispersions are 
quite different, one team may have a 7-foot player, whereas the other has no 
one over 6 feet tall, Two Mediterranean holiday cruises may advertise the same 
average age for their passengers. One, however, may have only passengers 
between 20 and 25 years old, while the other has only middle-aged parents in 
their 40s together with their children under age 10. 
‘There are four primary ways of describing variability or dispersion: 


1, The range, which is the difference between the largest and. smallest 
values. 

2. The average deviation, which is found by averaging the absolute 
differences of all the values from the mean, 

3. The variance, which is determined by averaging the squared differ- 
ences of all the values from the mean, 

4, The standard deviation, which is the square root of the variance. 


Tzeng 21 The monthly rainfall in Monrovia, Liberia, where May through 
—— _ October is the so-called rainy season and November through April 
the dry season, is as follows: 


Monti: Jon Feb Mar Apr ‘May Jon Jul og Sop Oct Nov Dec 
BinGa® 1 2 4 6 1 37M 1m MO 4 


The mean is 


1+24+4+6+18+ 37+ 31416 +28 +24 +944 
12 


= 15 inches 
nu 
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‘What are the measures of variability? 


Answer: Range: The maximum is 37 inches (June), and the 
minimum is 1 inch Ganuary). Thus the range is 37 ~ 1 = 36 inches 
of rain. 


Average deviation: 
f=19 + 2-19 +H - 19+) 19+ — 19+... — 15 
12 


= M+IS+ 4943+ 22416414134 9+6411 
m2 


= 107 inches 
Variance: 


M2 + FIP +P +H +2 +S + HIF + HF +E +P 
2 
Standard deviation: V3.7 = 12.0 inches 


= 1437 


RANGE 


‘The simplest, most easily calculated, measure of variability is the range, 
‘The difference between the largest and smallest values can be noted quickly, 
and it does give some impression of the dispersion. However, it is entirely 
dependent on the two extreme values and is insensitive to the ones in the 
middle. 

‘One use of the range is to evaluate samples with very few items. For 
example, some quality control techniques involve taking periodic small sam- 
ples and basing further action upon the range found in several such samples. 


AVERAGE DEVIATION 


Unlike the range, most other measures of variability are concerned with the 
dispersion around some center or representative value. The average deviation 
is the sum of the deviations from the mean (without regard to sign) divided 
by the number of elements: 


Average deviation = Bsa 
where 41 is the mean and 1 is the population size 

‘The average deviation gives information in an easily understood form. If a 
town has a mean yearly snowfall of 90 inches with an average deviation of 10 
inches, the reader has an intuitive sense that a yearly snowfall between 80 and 
100 inches would probably not be unusual. If two companies advertise mean 
starting salaries of $25,000, but in one the average deviation is $500 while in 
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the other it is $8,000, applicants will have some feeling for what this difference 
‘means. The average deviation is also useful with regard to discrepancies in 
populations where all the values are planned rather than related to chance 
occurrence. In such cases it is worthwhile to discuss how far from the mean 
each value is chosen to be, and what the average is of these differences, 


VARIANCE 


In many situations, dispersion is not planned, but rather is the result of 
various chance happenings. In such cases, the average deviation is not the 
proper tool for measuring variability. For example, consider the motion of 
microscopic particles suspended in a liquid. The unpredictable motion of any 
particle is the result of many small movements in various directions caused by 
random bumps from other particles, If we average the total displacements of 
all the particles from their starting points (this corresponds to the average 
deviation), the result does not increase in direct proportion to time. If, 
however, we average the squares of the total displacements of all the particles, 
this result does increase in direct proportion to time. 

‘The same holds true for the movement of paramecia. Their seemingly 
random motions as noted under a microscope can be described by the fact 
that the average of the squares of the displacements from their starting points 
is directly proportional to time. Also, consider ping-pong balls dropped straight 
down from a high tower and subjected to chance buffeting in the air, We can 
measure the deviations from a center spot on the ground to the spots where 
the balls actually strike. As the height of the tower is increased, the average of 
the squared deviations increases proportionately. 

Ina wide variety of cases we are in effect trying to measure dispersion from 
the mean due to a multitude of chance effects. The proper tool in these cases 
is the average of the squared deviations from the mean. This is called the 
variance and is denoted by a2 (a is the lower case Greek letter sigma) 


e@ = Lan 


m 


For circumstances specified in Chapter 11, the variance of a sample, denoted 
by 8, is calculated as 


Zor - 3? 
o= tea 


i 


22 During the years 1929 through 1939 of the Great Depression, the 
weekly average hours worked in manufacturing jobs were 45, 43, 
41, 39, 39, 35, 37, 40, 39, 36, and 37. What is the variance? 
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Answer: 


p = SAAB + AI + 39 + 39 + 35 + 37 + 40+ 394 36437 
1 


= 392 bours 


og? = AS23920 + 43 ~ 392)+._.+G6 — 392) + 37 - 3927 
Tt 


Important for later use is a procedure to calculate the variance for a set of 
differences, 


al 


Example 2.3 Consider the following from Chapter 1 


X = (2,9, 11, 221, ¥ = (5, 7, 151, and Z = the set of differences 
6, 4, =4, 17, 15, 71. What is the variance 


_ =? + 9-1 + 1-1 + 2-1 
a 


= Basis 

ry 
6 = neha CoD tO= 9 = 9) Ss. e 
” 3 3 
w= 

(23 = 2 + 5 ~ 28 + 13 = 24H = 2? 
cles 12 

= 22 2 017 


2 

How are o3,a}and 2 related? Note that in the above example, 
2 = oi +a}! This is true for the variance of any set of differences, More 
generally, ifa total is formed by a procedure that adds or subtracts one element 
from each of several sets, the variance of the resulting totals is simply the sum 
of the variances of the several sets. 

Not only can we sum variances as shown above to calculate total variance, 
but we can also reverse the process and determine how the total variance is 
split up among its various sources. For example, we can find what portion of 
the variance in numbers of sales by a company’s sales representatives is due 
to the individual salesperson, what portion is due to the territory, what portion 
is due to the particular products sold by each salesperson, and so on. (The 
average deviation cannot be partitioned in this way.) 

‘An arithmetic tool for calculating the variance is: 
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In words, the variance can be found by subtracting the square of the mean 
from the average of the squared scores. 


Troupe PA Let X = (3, 7, 15, 231. What is the variance? 


Answer: 
Lx a3+7+15+23= 48 
Lat =F + 7 +158 + 239" = B12 
22.8. 
” 772 


‘The variance can be calculated from its definition: 


= Rasp 
° n 


(3 — 127 + ¢ 


ye + 5 = 12)? + ( 


(Or it can be calculated as follows: 


2 = 812 
. 4 


~ 12 = 203-144 = 59 


Similarly, there is an arithmetical tool for calculating the variance of a 
sample: 


yep - oe 
= 


STANDARD DEVIATION 


Suppose we wish to pick a representative value for the variability of some 
population. The preceding discussions indicate that a natural choice is the 
value whose square is the average of the squared deviations from the mean. 
‘Thus we are led to consider the square root of the variance. This is called the 
standard deviation, is denoted by 0, and is calculated as follows: 
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Similarly, the standard deviation of a sample (see Chapter 11) is denoted by 
sand is calculated as follows: 
2 oe 


o- Poa - 7 
m1 nT 


Tempe 25 The approximate numbers of bank suspensions during the years 
1928 through 1933 were 500, 650, 1350, 2300, 1450, and 4000, 
respectively. What is the standard deviation? 


Answer: 
p= 500+ 650+ 1350+ 2300 + 1450+ 4000 _ srgg 
2 + syst 
ge Lot use = sos + ss + 250" + 23007 1394375 


o = ¥1395375 ~ 1181 


‘Two secondary procedures for measuring variability or dispersion are the 
interquartile range and the relative variability 


INTERQUARTILE RANGE 


‘This is one method of removing the influence of extreme values on the 
range. The interquartile range is calculated by arranging the data in numerical 
‘order, removing the upper and lower one-quarter of the values, and then 
noting the range of the remaining values. 


Example 26 


Suppose that farm sizes in a small community have the following 
characteristics: the smallest value is 16.6 acres, 10% of the values 
are below 23.5 acres, 25% are below 41.1 acres, the median is 57.6 
acres, 60% are below 87.2 acres, 75% are below 101.9 acres, 90% 
are below 124.0 acres, and the top value is 201.7 acres. 

4. What is the range? 


Answer: The range is 201.7 ~ 166 = 185.1 acres. 
6. What is the interquartile range? 


Answer: The interquartile range, with the highest and lowest 
one-quarter of the values removed, is 101.9 ~ 41.1 = 60.8 acres. 
‘Thus, while the largest farm is 185 acres more than the smallest, 
the middle 50% of the farm sizes range over a 61 acre interval. 
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RELATIVE VARIABILITY 


‘A comparison of two variances is more meaningful if the means of the 
populations are also taken into consideration. Relative variability is defined to 
be the quotient obtained by dividing the standard deviation by the mean. 
Usually the result is then expressed as a percentage. 


Trample 27 


Suppose that the mean salary for police officers is $45,000 with a 
standard deviation of $9000, while the mean salary for fire fighters 
is $35,000 with a standard deviation of $7700. What are the 
respective relative variabilities? 


Answer: The relative variabilities are 9000/45,000 and 
7700/35,000, or 20% and 22%6, respectively. Thus, while the police 
officers’ salaries vary more in absolute terms, the fire fighters’ 
salaries vary more in relative terms when the difference in means 
is ken into consideration. 


EXERCISES 


1, The number of criminals executed in the United States during the 1930s 
and 1940s were as follows: 1930—155; 1931—153; 1932—140, 1933-160; 
1934—168; 1935-199, 1936-195; 1937—147; 1938-190; 1939-160, 
1940—124; 1941123; 1942147; 1943131; 1944—120, 1945—117; 
1946—131; 1947—153; 1948—119; 1949—119. What are the mean, the 
range, and the standard deviation? 


2. According to a 1988 New York Times anicle, the ten car models with the 
highest theft rates were as follows: 


Vehide Made Tatts per 1000 Cars 
Pontiac Fiebied 30.4 
Chevrolet Camaro 26.02 
Chevrolet Monte Carlo 20.28 
Toyota MR2 1925 
Buick Regal 1470 
Migsubishi Starion 1470 
Ferrari Mondial 13.60 
Misubishi Mirage 1280 
Pontiac Fiero 12.68 
Oldsmobile Cutlass 1173 


a, What are the mean, range, and standard deviation of these theft rates? 
b. How will each of these values change if each theft rate increases by 
152 by 1592 


3. Because Brazilian doctors do not use generic prescriptions, black markets 
exist in Brazil for many drugs (Tbe Lancet, October 15, 1994). For example, 
the black market price per tablet of acetaminophen for seven brands are 
as follows: Acetofen, $0.23; Dorico, $0.10; Pacemol, $0.17; Parador, $0.14; 
Tylenol, $0.18; Paracetamol, $0.08; and Nevralgina, $0.27. What are the 
‘mean, range, and standard deviation of these prices? 


Study Questions 
1. Suppose that the same constant is added to every value in a population. 
How does this affect the range? The average deviation? The variance? The 
standard deviation? 

2. Suppose that each value in a set is multiplied by the same constant, How 
does this affect the range? The average deviation? The variance? The 
standard deviation? 

3, Does it make sense to talk about the average of the deviations from the 
mean (with regard to sign? In other words, what can be said about 
Xr-w), 
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4, If the median was the most significant measure of central tendency, the 
appropriate measure of variability might well be Elena Show that 
Zla— Ki is smallest when K = M (the median). {Hin Think about the 
position of Mamong the as. As we move away from M by a small amount 
A, what happens to each Ix Al in changing to lx — KI where K= M 
+ ie} 

5. If youve had an introductory caus couse, use cakuls to show that 
Xx ~ 9#is smallest when =p. Hint (x ~ 92 = ? Set equal to 0 and 


solve for t] 


POSITION 


We have seen several ways of choosing a value to represent the center of 
a distribution. We also need to be able to talk about the pasition of any other 
value. In some situations, such as wine tasting, simple rankings are of interest 
Other cases, for example, evaluating college applications, may involve posi- 
tioning according to percentile rankings. There are also situations in which 
position can be specified by making use of measurements of both central 
tendency and variability 

‘There are three important, recognized procedures for designating position: 


1. Simple ranking, which involves arranging the elements in some order 
and noting where in that order a particular value falls, 

2. Percentile ranking, which indicates what percent of all values fall below 
the value under consideration 

3. The z-score, which tells very specifically by how many standard devia- 
tions a particular value varies from the mean. 


Ly 


‘The water capacities (in gallons) of the 57 major solid-fuel boilers 
sold in the United States are as follows: 6.3, 7.4, 86, 10, 12.1, 50, 
8.2, 98, 11.4, 12.9, 14.5, 16.1, 26, 21, 27, 40, 55, 30, 35, 55, 65, 
18.8, 23.3, 28.3, 33.8, 26.4, 33, 50, 35, 21, 21, 18.5, 26.4, 37, 12, 12, 
50, 65, 65, 56, 66, 60, 70, 27.7, 34.3, 42, 46.2, 33, 25, 29, 40, 19, 9, 
125, 15.8, 245, and 16.5 John W. Bartok, Solid-Fuel Furnaces & 
Boilers, Garden Way). What is the position of the Passat HO-45 
which has a capacity of 46.2 gallon? 

Answer: Since there are 12 boilers with higher capacities on the 
list, the Passat has a simple ranking of 13th (out of 57). 
Fony-four boilers have lower capacities, so the Passat has a 
Percentile ranking of 44/57 = 77.2%. 


‘The above list has a mean of 30.2 and a standard deviation of 17.8, 
so the Passat has a z-score of (46.2 — 30.3)/17.8 = 0.89. 


SIMPLE RANKING 


Simple ranking is easily calculated and easily understood. We know what it 
means for someone to graduate second in a class of 435, or for a player from 
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a team of size 30 to have the seventh best batting average. Simple ranking is 
useful even when no numerical values are associated with the elements. For 
example, detergents may be ranked according to relative cleansing ability 
without any numerical measurements of strength. 


PERCENTILE RANKING 


Percentile ranking, another readily understood measurement of positio 
helpful for comparing positions with different bases. We can more ea 
‘compare a rank of 176 out of 704 with a rank of 187 out of 935 by noting that 
the first has a rink of 75%, the second a rank of 80%. Percentile rank is also 
useful when the exact population size is not known or is irrelevant. For 
example, it is more meaningful to say that Jennifer scored in the 90th percentile 
(on a national exam, rather than trying to determine her exact ranking among, 
some large number of test takers. 


Z-SCORE 


‘The z-score is a measure of position that takes into account both the center 
and the dispersion of the distribution. More specifically, the z-score of a value 
tells how many standard deviations the value is from the mean, Mathematically, 
X — gives the raw distance from 4 to 2: dividing by o converts this to 
‘numbers of standard deviations. Thus z = (x ~ 4x, where 2° is the raw score, 
Is the mean, and aris the standard deviation. If the score x is greater than 
the mean 2, then z is positive; if x is less, then z is negative. 

Given a score, we can reverse the procedure and find the corresponding 
raw score. Solving for x gives x = x + 20. 


Example 32 Suppose that the average (mean) price of gasoline in a large city 
is $1.80 per gallon with a standard deviation of $0.05. Then $1.90 
has a z-score of (1.90 — 1.80)/0.05 = +2, while $1.65 has a 
z-score of (1.65 — 1,800.05 = —3. Alternatively, a z-score of 
+2.2 corresponds to a raw score of 1.80 + 2.2(0.05) = 1.80 + 
0.11 = 1.91, while a z-score of 1.6 corresponds to 1.80 — 
1.60.05) = 1.72. 


Its often useful to portray integer 2-scores and the corresponding 
raw scores as follows: 


Paco 


fem 33 
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Suppose the attendance at a movie theater averages 780 with a 
standard deviation of 40. Adding and subtracting multiples of 40 
to the mean 780 gives: 


740 Theater 
attendance 
1 1 1 1 1 
—2 o 1 2 3 score 


An attendance of 835 would convert to a z-score as follows: 


$35 780 «55.2 1.375 a zscore of ~215 would conver 10 a 


theater attendance as follows: 780 ~ 2.15(40) = 694. 


700740 Theater 
attendance 
1 1 
=3 3 score 


‘An assembly line produces an average of 12,600 units per month 
‘with a standard deviation of 830 units. Adding and subtracting 
multiples of 830 to the mean, 12,600, gives: 


1 1 1 1 1 4 4 


TOLI0 1094011770 1260013430 1426015090 Units/mo 
1 1 1 1 1 1 1 
a a a 2 3 Escore 


‘Thus 11,106 has a z-score of (11,106 — 12,600)/830 = —1.8, while 
a z-score of 2.4 corresponds to 12,600 + 2.4(830) = 14,592. 


08, M4592 
at if 1 1 1 1 [ L 
TOI10 10940 11770 12600 1343014260 15090 Units/mo 
a 1 1 L 1 1 1 
3-3] + 8 2 2 Fscore 


Cy 
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EMPIRICAL RULE 


‘The empirical rule applies specifically to symmetric, “bell-shaped” data. In 
this case, about 68% of the values lie within one standard deviation of the 
mean; about 95% of the values, within two standard deviations of the mean; 
and more than 99% of the values, within three standard deviations of the mean. 

Shape will be discussed in detail in Chapter 4, but consider intuitively the 
following displays, where the horizontal axis shows scores: 


35 Suppose that taxicabs in New York City are driven an average of 
75,000 miles per year with a standard deviation of 12,000 miles. 
‘What information does the empirical rule give us? 


Answer: Assuming that the distribution is “bell-shaped,” we can 
conclude that approximately 68% of the taxis are driven between 
163,000 and 87,000 miles per year, approximately 95% are driven 
between 51,000 and 99,000 miles, and virtually all are driven 
between 39,000 and 111,000 miles 


The empirical rule also gives a useful quick estimate of the standard 
deviation in terms of the range. We can see in the display above that 95% of 
the data fall within a span of four standard deviations (from —2 to +2 on the 
z-score line), and 99% of the data fall within six standard deviations (from ~3 
to +3 on the score line). It is therefore reasonable to conclude that for these 
data the standard deviation is roughly between one-fourth and one-sixth of the 
range. Since we can find the range of a set almost immediately, the empirical 
rule technique for estimating the standard deviation is often helpful in pointing 
‘out probable arithmetic errors. 


Tromple 36 Ifthe range of a data set is 60, what is an estimate for the standard 
deviation? 


Answer: By the empirical rule, the standard deviation would be 
expected to be between (1/6)60 = 10 and (1/4)60 = 15. If the 
standard deviation is calculated to be 0.32 or 87 there is a probable 
arithmetic error; a calculation of 12, however, is reasonable. 
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CHEBYSHEV’S THEOREM 


When data are spread out, the standard deviation is larger; when data are 
tightly compacted, the standard deviation is smaller. However, no matter what 
the dispersion, and even if the data are not bell-shaped, certain percentages of 
the data will always fall within specified numbers of standard deviations from 
the mean. 

‘The Russian mathematician Chebyshev showed that, for any set of data, at 
least 1 — 1/¥# of the values lie within & standard deviations of the mean, This 
result is known as Chebyshev's Theorem. In terms of percentages, at least 
100(1 — 1/426 of the values are within & standard deviations of the mean, In 
terms of scores, at least 1 — 1/4? of the values have scores between —k 
and +k Therefore, when & = 3, at least 1 ~ 1/9 or 88.89% of the values lie 
within three standard deviations of the mean, And, when & = 5, at least 
1 ~ 1/25 of 96% of the values have =scores between —5 and +5. 


«| 


Suppose an electronic par takes an average of 3.4 hours to move 
through an assembly line with a standard deviation of 0.5 hour. 
What does Chebyshev’s Theorem give using # = 2and k= 4? 


Answer: Using k = 2, we find that at least 1 — 1/4 or 75% of the 
pans take between 3.4 — 2(0.5) = 2.4 hours and 3.4 + 2(0,5) = 
44 hours to move through the line. Similarly, with k= 4, at least 
15/16 oF 93.75% of the parts take between 1.4 and 5.4 hours. 


Given a range around the mean, we can convert to 2scores and ask about 
the percentage of values in the range. If one set has mean jt = 85 and standard 
deviation & = 1, while a second set has 4. = 85 and @ = 5, then, for example, 
the percentages of values between 75 and 95 will be different. For the first set, 
the relevant zscores are +10; for the second the relevant zscores are +2. 
‘Thus at least 99/100 or 99% of the first set’s values, but possibly only 3/4 or 
75 of the second set’s values, lie between 75 and 95, 


Tremp 38 Suppose the daily intake at a toll booth averages $3500 with a 
standard deviation of $200. What percentage of the daily intakes 
should be between $3000 and $4000? 


Answer: The relevant zscores are +500/200 = 2.5, so Cheby- 
shev’s theorem says that at least 1 — 12.5)? = 21/25 or 84% of 
the daily intakes should be between $3000 and $4000. 


Note that, for k= 1, 1 — 1/2 = 0, here the theorem gives no useful information, 

Reminder: The power of Chebyshev's theorem is that it applies to ail sets 
of data. However, if the data are bell-shaped, we can draw stronger conclu- 
sions by using the empirical rule stated earlier. 
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EXERCISES 


1, According to the National Center for Health Statistics, of the 50 states, 
Hawaii has the 7th highest birth rate, the 4th highest abortion rate, and the 
48th highest death rate (only Alaska and Utah are lower). Convert these 
three simple rankings for Hawaii’s birth, abortion, and death rates into 
percentile rankings. 


2. The 70 highest dams in the world have an average height of 206 meters 
with a standard deviation of 35 meters. 

a. The Hoover, Glen Canyon, and Grand Coulee dams have heights of 
221, 216, and 168 meters, respectively. Convert each of these heights 
into a >score, 

b. Three Russian dams, the Nurek, Toktogul, and Charvak have heights 
with zscores of +2.69, +0.28, and ~1.09, respectively. Convert each 
of these 2scores into a riw score, 


3. According to Consumer Reports magazine, the costs per pound of protein 
for 20 major-brand beef hot dogs are as follows: $14.23, $21.70, $14.49, 
$20.49, $14.47, $15.45, $25.25, $24.02, $18.86, $18.86, $30.65, $25.62, 
$8.12, $12.74, $14.21, $13.39, $22.31, $19.95, $22.90, and $19.78. If the least 
expensive is considered the top ranked, what is the position of Thorn 
Apple Valley Brand beef hot dogs at $14.23 per pound of protein? Give 
the answer as a simple ranking, as a percentile ranking, and as a =score, 


4. The average cost per ounce for glass cleaners is 7.7¢ with standard 
deviation of 2.56. 
a, Make a diagram with one horizontal line showing the 2scores ~3, ~2, 
=1,0, 1, 2, 3, and a second line showing the corresponding raw scores, 
b. What is the score of Windex with a cost of 10.1¢? 


5. The average number of degree-days during the month of January in 
Alaskan cities is 1751 with a standard deviation of 552 
a, Make a diagram showing the zscores ~3, ~2, 1, 0, 1, 2, 3, and the 

corresponding raw scores. 
'b. What is the >score of Fairbanks with 2319 degree-days? 

6. Ina certain southwestem city the air pollution index averages 62.5 during 
the year with a standard deviation of 18.0. Assume that the empirical rule 
is appropriate. 

a. The index will fall within what interval 959% of the time? 
, For what percentage of the time will the index fall below 26.5? 


7. During a recent year the average daily price of gold was $475 with a 
standard deviation of $55. 
a. At least 75% of the daily prices were within what interval? 
b. At least what percentage of the daily prices were between $310 and 
$6407 


Study Questions 
1, What does Chebyshev’s theorem say about the percentage of values that 
are three or more standard deviations from the mean? About the percent- 
age of values that are more than four standard deviations from the mean? 
2. Consider the sample from Exercise 3. 
a. What percentage of this sample fall within one standard deviation of 
the mean? Within two standard deviations? Within three? 
b. Compare these answers to the results predicted by Chebyshev's theo- 
rem and predicted by the empirical nule. Do the results suggest bell- 
shaped data? 


SHAPE 


There are a variety of ways to organize and arrange data. Much information 
can be put into tables, but these arrays of bare figures tend to be spiritless and 
sometimes even forbidding. Some form of graphical display is often best for 
seeing patterns and shapes and for giving an immediate impression of every- 
thing about the data. 

In this chapter we develop an understanding of the imponant visual 
representation of data called a bistogram, and then briefly introduce two other 
displays: stem and leaf and box and whisker. 


HISTOGRAMS 


Trample 41 Suppose there are 2000 families in a small town and the distribu- 

——— tion of children among them is as follows: 300 families are 
childless, 400 have one child, 700 have two children, 300 have 
three, 100 have four, 100 have five, and 100 have six. These data 
can be displayed in the following bar grapb: 


aae Lol 
o 4 2 8 4 5 @6 


‘The frequencies of different results are indicated by the beights of 
the bars representing these results. 


A bistogram can be constructed from the above bar graph by 
widening each bar until the sides meet at a point halfway be- 
tween. 
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° 1 2 8 4 5 6 


Sometimes, instead of labeling the vertical axis with frequencies, it is more 
convenient or more meaningful to use relative frequencies, that is, frequencies 
divided by the total number in the population. 


° 300 300/200 = 150 
1 400 400/200 = 200 
2 700 70/2000 = 350 
3 300 30/2000 = .150 
4 100 100/2000 = .050 
5 100 10/2000 = .050 
6 100 10/2000 = .050 


3 


Note that the shape of the histogram is the same whether the vertical axis 
is labeled with frequencies or with relative frequencies, Sometimes we show 
both frequencies and relative frequencies on the same graph, 
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"42 Consider the following histogram displaying 40 salaries paid to the 
top level executives of a large company. 


6 


& 


2 9 4 50 6 7 8 9% 
Salary (in $1000's) 


‘What can we learn from this histogram? 


For example, none of the executives earned more than $90,000 or 
less than $20,000. Twelve earned between $50,000 and $60,000. 
‘Twenty-five percent eamed between $40,000 and $50,000. Note 
how this histogram shows the number of items (here, salaries) 
falling between certain values, whereas the preceding histogram 
showed the number of items (families) falling at each value. 

Trample 43 Consider the following histogram where the vertical axis has not 

—~— been labeled. What can we learn from this histogram? 
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Answer: It is impossible to determine the actual frequencies; 
however, we can determine the relative frequencies by noting the 
fraction of the total area that is over any interval. 


2 FD 
‘We can divide the area into ten equa! portions, and then note that 


1/10 or 10% of the area is above 25-26, 20% is above 26-27, 40% 
is above 27-28, and 30% is above 28-29. 


Although it is usually not possible to divide histograms so nicely 
into ten equal areas, the principle of relative frequencies corre- 
sponding to relative areas will still apply. 


‘The following histogram indicates the relative frequencies of ages 
of U.S. scientists in 1967, What can we learn from this histogram? 


% 0 %& 0 © 8 8 © 6 7 75 


Answer: If we compare areas (or count small rectangles!), we can 
conclude that 50% of the scientists are between 25 and 40 years 
of age, 20% are between 45 and 55 years, and so on. Now if we 
also knew that there were 300,000 U'S. scientists in 1967, we could 
convert these percentages to frequencies, such as 150,000 scien- 
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tists between 25 and 40 years of age, or 60,000 scientists between 
45 and 55 years of age. 


HISTOGRAMS AND MEASURES 
OF CENTRAL TENDENCY 


Suppose we have a detailed histogram such as this one: 


ae 


Our measures of central tendency fit naturally into such a diagram. 


1 


‘The mode is defined as the most frequent value, so it is the point or interval 
at which the graph is highest. 


Mode 


0% 50% 


The median divides a distribution in half, so it is represented by a line that 
divides the area of the histogram in half. 
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The mean is affected by the spacing of all the values. Therefore, if the 
histogram is considered to be a solid region, the mean corresponds to a line 
passing through the center of gravity. 

‘The above distribution, spread thinly far to the low side, is said to be skewed 
to the left. Note that in this case the mean is less than the median. Similarly, a 
distribution spread far to the high side is skewed 0 the right, and its mean is 
greater than its median, 


Example 45 Suppose that the faculty salaries at a college have a median of 
$32,500 and a mean of $38,700. What does this indicate about the 
shape of the distribution of the salaries? 


Answer: The median is less than the mean, so the salaries are 
probably skewed to the right. There are a few highly paid 
professors with the bulk of the faculty at the lower end of the pay 
scale 


HISTOGRAMS, Z-SCORES, AND PERCENTILE 
RANKINGS 


We have seen that relative frequencies are represented by relative areas and 
so labeling the vertical axis is not crucial. If we know the standard deviation, 
the horizontal axis can be labeled in terms of 2-scores. In fact, if we are given 
the percentile rankings of various z-scores, we can construct a histogram. 


Example 4.6 Suppose we are given these data: 


z-score: 2-1 0 
Percentile ranking: ° 2% © 60 70 100 


and asked to construct a histogram. 


We note that the entire area is less than z-score +2 and greater 
than z-score —2. Also, 20% of the area is between z-scores ~2 
and ~1, 40% is between —1 and 0, 10% is between 0 and 1, and 
30% is between 1 and 2. Thus the histogram is as follows: 


{ 
4 
e+ 
4 


24a 0 1 2 24 


Now suppose that we are given four in-between zscores as well: 


roere Percentile renking 
20 
15 
10 
os 
00 
05 
10 
-15 


ou SSazes 


‘Then we have this histogram: 


240 14 2 


With a 1000 scores perhaps the histogram would look like this: 
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The height at any point is meaningless; what is important is 
relative areas. For example, in the final diagram above, what 
percent of the area is between 2scores of +1 and +2” 
Answer: Still 30%. 

What percent is to the left of 0 

Answer: Still 60%. 


Itis often useful to plot two horizontal scales, one with raw scores and one 
with the corresponding >scores. 


Example 47 The heights (in feet) of the 71 mountains over 10,000 feet tall are 


as follows: 29,028; 28,250; 28,208; 26,810; 26,660; 26,504; 25,645; 
25,447; 25,355; 24,902; 24,757; 23,997; 
23,891; 22,831; 22,310; 22,211; 21,489; 
21,276; 20,561; 19,850; 19,347; 18,855; 
18,481; 18,376; 17,999; 17,887; 16,946; 
16,864; 16,795; 16,503; 15,771; 15,157; 
14,688; 14,494; 14,410; 14,272; 14,022; 
13,845; 13,796; 13,665; 13,642; 12,972; 


12,461; 12,389; 12,349; 12,172; 12,060; 11,411; 11,053; and 10,457. 


Straightforward arithmetic yields y= 2¥ = 1515563 ~ 959 


and ng = \V=— ~ w? = ¥25,084,099 = 5008. Counting ele- 


ments (1/71 = 1.41% per element) yields percentile rankings. 


Number Percentile 


core Raw-score less ranking 
25 31019 7 100 
20 28545 70 986 
1s 26041 65 15 
10 23537 3S 75 
05 21033, 662 
0.0 18529 40 56.3 
~05 16025 2 394 
10 413521 n 155 
“15 017 1 14 


-20 8513, 0 0 


Exonple 48 Consider the following murder rates (per 100,000 people) for each 
of the 50 states: AL, 13.3; AK, 12.9; AZ, 9.4; AR, 9.1; CA, 11.7; CO, 
7.3; CT, 4.2; DE, 6.7; FL, 11.0; GA, 14.4; HI, 6.7; ID, 5.4; TL, 9.9; 
IN, 6.2; 1A, 26, KY, 9.0; LA, 15.8; ME, 2.7; MD, 8.2; MA, 
3.7; MI, 10.6; MN, 2.0; MS, 12.6, MO, 10.4; MT, 4.8; NE, 3.0; NV, 
15.5; NH, 1.4; NM, 10.2; NY, 10.3; NC, 10.8; ND, 1.2; OH, 6.9; OK, 
85; OR, 5.0; PA, 6.2; RI, 4.0; SC, 11.5; SD, 1.9; TN, 9.4; TX, 14.2; 
UT, 3.7; VT, 3.3: VA, 88; WA, 4.6; WY, 68; WI, 2.5; WY, 7.1. 
Using our basic formulas gives p = 7.57 and @ = 391, We 
calculate raw scores corresponding to various scores by using x 
= 7.57 + 3912 Finally, we count elements (2% per element) to 
obtain percentile rankings. 


pwor: -2 -15 -1 -05 0 1 15 2 25 3 
rawescore: -0.25 1.71 366 562 757 953 1148 1343 15.39 1734 
ranking 9 4 18 3% S4 68 82 92 9% 100 


c 


T —- I—1—r 
025 171 366 S62 757 953 1148 1349 15.39 17:34 Murder rates 


tT T a a a 
20 4S 1 05 0 05 1 15 2 25 20000 
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STEM AND LEAF DISPLAYS 


Although a histogram may show how many scores fall into each grouping 
or interval, the exact values of individual scores are often lost. An alternative 
pictorial display, called a stem and leaf display, retains this individual in- 
formation. 


Example 4.9 Consider the set (25, 33, 28, 31, 45, 52, 37, 31, 46, 33, 20). Let 2, 
3, 4, and 5 be place holders for 20, 30, 40, and 50. List the last 
Aligit of each value from the original set after the appropriate place 
holder. 


‘The result is the stem and leaf display of these data: 


Stems Leaves 
2 580 
3 31713 
4 56 

5 2 


Drawing a continuous line around the leaves results in a horizon- 
tal histogram: 


Note that the stem and leaf display gives the shape of the histogram and, 
unlike the histogram, indicates the values of the original data 

A revised stem and leaf display is obtained by rearranging the numbers in 
each row in ascending order. This ordered display shows a second level of 
information from the original stem and leaf picture. 

The revised display of the data in Example 4.9 is as follows: 
8 
337 


Trample 410 Suppose the distribution of 25 advertised house prices (in thou- 
sands of dollars) in a certain community is given by the set: (56, 
89, 165, 73, 83, 145, 90, 189, 127, 77, 110, 112, 132, 120, 94, 130, 


84, 65, 99, 154, 86, 120, 122, 103, 1301, 
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One possible stem and leaf display of these data is as follow: 


30-74 | 56 65 73 

75-99 | 77 83 BE 86 89 90 94 99 
100-124 | 03 10 12 20 20 

125-149 | 22 27 30 30 32 45 
150-174 | 54 65 

175-200 | 99 


Note that the stems were chosen to be intervals of length 25, and 
that the “1” of 100 is left out of the leaves so that they align 
vertically, 


BOX AND WHISKER DISPLAYS 


‘A bax and whisker display is a visual representation of dispersion that shows 
the largest value, the smallest value, the median, the median of the top half of 
the set, and the median of the bottom half of the set. 


Example ATT The total farm proctuct indexes for the years from 1919 through 
1945 (with 1910-14 as 100) are as follows: 215, 210, 130, 140, 150, 
150, 160, 150, 140, 150, 150, 125, 85, 70, 75, 90, 115, 120, 125, 100, 
95, 100, 130, 160, 200, 200, 210. (Note the instability of prices 
received by farmers!) The largest value is 215, the smallest is 70, 
the median is 140, the median of the top half is 160, and the 
median of the bottom half is 100. A bax and whisker display of 
these five numbers is as follows: 
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Note that the display consists of two “boxes” together with two “whiskers’— 
hence the name. The boxes show the spread of the two middle quarters; the 
whiskers, the spread of the two outer quarters. This relatively simple display 
conveys information not immediately available from histograms or stem and 
leaf displays. 


EXERCISES 


In Exercises 1-4, draw an appropriate histogram for each set of data 
1. The 54 major brands of cottage cheese on the market have the following 
numbers of calories per 4-ounce serving: 


Calories 75-84 85-94 95-104 105-114 115-124 
Number of brands 15 10 9 15 5 


2. The 60 longest rivers in the world have lengths distributed as follows: 


Length (miles) 102-1099 1500-19 2000-2499 2500-2999 3000-699 3500-399 om-HOD 
Number of rivers 21 a ‘ s 2 2 1 


(The Nile is the longest with a length of 4145 miles, and the Amazon is 
second at 3900 miles.) 


3. Use the farm product index data in Example 4.11 for this exercise. Plot 
two horizontal scales, one with raw scores and one with corresponding 
scores. Use intervals of 1.0 along the score scale for the histogram. 


4. ‘The ages of US. presidents at inauguration were as follows: 57, 61, 57, 57, 
58, 57, 61, 54, 68, 51, 49, 64, 50, 48, 65, 52, 56, 46, 54, 49, 50, 47, 55, 55, 
54, 42, 51, 56, 55, 51, 54, 51, 60, 62, 43, 55, 56, 61, 52, 68, 65, and 46, Plot 
two horizontal scales, one with raw scores and one with corresponding 
scores. Use intervals of 0.5 along the z-score scale for the histogram, 


5. a. Prepare a stem and leaf display for the set (126, 195, 149, 122, 189, 164, 
228, 177, 165, 150, 169, 127, 176, 147, 148, 159, 128, 122, 150, 193, 207, 
164, 168, 110, 155, 127, 152, 174, 190, 219, 125, 193, 141, 127, 155, 133, 
150, 162, 168, 128, 125, 137, 146, 120, 154, 176, 166, 117, 154, 137), 
which represents the 1988 per capita personal income (in hundreds of 
dollars) for the $0 states. 
b, Draw a box and whisker display of the above data. 


6, According to the 1992 NAEP Trial State Assessment the average mathemat- 
ics proficiency scores in eighth grade for 41 states were as follows: AL, 
251; AZ, 265; AR, 255; CA, 260; CO, 272; CT, 273; DE, 262; FL, 259; GA, 
259; HI, 257; ID, 274; IN, 269; 1A, 283; KY, 261; LA, 249; ME, 278; MD, 264; 
MA, 272; MI, 267; MN, 282; MS, 246; MO, 270; NE, 277; NH, 278; NJ, 271; 
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NM, 259; NY, 266; NC, 258; ND, 283; OH, 267; OK, 267; PA, 271; RI, 265; 

SC, 260; TN, 258; TX, 264; UT, 274; VA, 267; WV, 258; WI, 277; WY, 274. 

Draw a box and whisker display. 

Prepare a stem and leaf display. 

Find the mean and standard deviation of the 41 scores. 

According to Chebyshev’s theorem, how many of the scores should be 

within two standard deviations of the mean? How many of the scores 

are actually within this range? 

€. Do you think the empirical rule applies? According to the empirical rule, 
how many of the scores should be within one standard deviation of the 
mean? Within two? Within three? How many of the scores are actually 
within one, two, and three standard deviations of the mean? 


aoge 


7. A 1995 poll by the Program for Intemational Policy asked respondents 
What percentage of the U.S. budget they thought went to foreign aid. The 
mean response was 18%, and the median was 15%. (The actual amount is 
less than 19%.) What do these responses indicate about the shape of the 
distribution of all the responses? 


Study Questions 
1. Consider the following two histograms: 


130 140 150 160 170 


130 140 150 160 170 


‘True or false? 
a. Both sets have about the same mean. 

b. The variance of set 4 is greater than the variance of set B. 

¢. Chebyshev’s theorem applies to both sets. 

4. The empirical rule applies only to set A. 

€. You can be sure that the standard deviation of set A is greater than 5, 
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2. Consider the following histogram: 


$20,000 
Toul 
sales 
$79,000 
$78,000 
Teo W095 To08 
a, What is misleading about the vertical axis and the resulting sales 
picture? 


. Draw a histogram showing the same information, this time, but with a 
vertical axis stating at zer0, 


3. Consider the following histogram: 


1977-81 "1982-84 "1985-86 


Discuss what is misleading about this histogram. 


4. In your daily newspaper find a histogram that misrepresents the informa- 
tion given. 


Part 4 


PROBABILITY 


In the world around us, unlikely events sometimes take place. At 
other times, events that seem inevitable do not occur. Because of the 
‘myriad and minute origins of various happenings, it is often im- 
practicable, or stmply impossible, to predict exact outcomes. How- 
ever, while we may not be able to foretell a specific result, we can 
sometimes assign what is called a probability to indicate the likelt- 
hood that a particular event will occur. 

For our study of statistics, we need an understanding of the 
probability that a given elementary event will happen many times, 
each time under the same circumstances. We want to be able to 
deduce the chance or prospect of occurrence of such events, and 
then use the result to draw inferences about more complex circum- 
stances for which complete information is unavailable. For exam- 
ple, we might analyze tbe past movements of various stock prices 
given various economic conditions, calculate specific probabilities, 
and then ask what can be said, with what degree of confidence, 
about future movements. 

In Chapters 5-7 we concentrate on the development of the spe- 
cific techniques necessary to appreciate the basic principles of sta- 
tistical analysis which will be considered later. In particular, we 
need to understand (1) bow to use counting techniques; (2) bow to 
calculate elementary probabilities, including binomial prob- 
abilities, and (3) bow to calculate expected values. 


COUNTING 


Counting techniques are important in solving probability problems. In this 
chapter we develop the concept of combinations, a counting procedure critical 
in later applications. Along the way we learn the multiplicative rule and the 
way to count permutations. 

Examples 5.1 and 5.2 illustrate the use of the multiplicative rule 


Erenpl 5.1 A company must choose one of eight applicants for a secretarial 
position, and one of six applicants for a janitorial position. In how 
many ways can this be done? 


Answer: We can pair up cach of the eight secretarial applicants 
with each of the six janitorial applicants for a total of 8 X 6 = 48 


ways. 
25. Bs 
A manufacturer must choose one of two warehouses, one of five 


assembly plants and one of three distribution centers. In how 
many ways can this be done? 


ll 


Answer: There are 2 X 5 = 10 ways of picking a warehouse and 
an assembly plant, and for each of these there are 3 ways to pick 
a distribution center, for a total of 10 X 3 = 30 ways. 


SE aa) ane 
on, ESS, 
= tet 


‘The mukiplicative rule can be stated in more general terms. 
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MULTIPLICATIVE RULE 


fone event can occur in m ways, and, for each of these ways, a second event 
can occur in m ways, then the two events can occur together in mn ways. The 
same principle can be extended to three or more events, as in Example 5.2. 

‘The multiplicative rule is the underlying principle behind various counting 
techniques. 


Tramp 53 A president and vice president are to be picked from six candi- 
——— dates. In how many ways can this be done? 


Answer: There are 6 choices for president and then 5 remaining 
choices for vice president for a total of 6 X 5 = 30 ways. 


ways ot ways of 


Sroncet 


Tzeng 54 Three bonuses, one for $10,000, one for $5000 and one for $2000, 
——— are to be given to three employees chosen from a group of ten 
In how many ways can this be done? 


Answer: There are 10 choices for the $10,000 bonus, 9 remaining 
‘choices for the $5000 bonus, and then 8 remaining choices for the 
$2000 bonus for a total of 10 X 9 X 8 = 720 ways. 


a 
— a 
= 
sie io 
ee ee 


‘The answers to Examples 5.3 and 5.4 can be expressed in terms of factorials 
Gl= 32% 1=6,5!=54X3X2% 1 = 120, etc.) as follows: 


6x5 = SXSXAX3X2K1 LB 


4x 3X2x1 a 
~ 1X9XBX7X6X5X4X3X2X1 _ 10 
WAKE & TK6XS5K4X3K2X1 2 


Note that 6 X 5 has 2 factors, 6 — 2 = 4, and 6 X 5 = 616 ~ 2)!. Also 
10 X 9 X 8 has 3 factors, 10 — 3 = 7, and 10 X 9 X 8 = 101/10 — 3)! 
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Similarly, if we have r factors beginning with m, we get 1/(n ~ Pil. This is the 
basis of the permutation rule. 


PERMUTATION RULE 


The number of ways of choosing r distinct objects from among 7 objects, 
where ‘order is important” is ré/(n ~ 1), Each of the different ways of making 
this choice is called a permutation, and the total number of such permutations 
is denoted by P(n,7). Thus, Pn,r) = n/n ~ Pil. We can see that “order is 
important” in Examples 5.3 and 5.4; for example, choosing Tom to be president 
and Mary to be vice president is different from choosing Mary to be president 
and Tom to be vice president. 


Tremp 55. For a special advertising campaign, a company plans to pick one 
Of its nine products for TV, one for radio, one for magazines, and 
‘one for newspapers. In how many ways can this be done? 


a cy 
TE Hake forked 


POA) = 


In situations where order is not important, we must modify the counting 
procedure, 


! 


In how many ways can a two person committee be chosen from 
six people? 


Answer: Compare this problem with Example 5.3. In that case, 
the answer was 6 5 = 30, but that answer is too lange for this 
‘example. While choosing Tom as president and Mary as vice 
president is different from choosing Mary as president and Tom 
as vice president, there is only one two-person committee consist- 
ing of Tom and Mary. Thus 30 is two times too large, and the 
correct answer is 30/2 = 15. 


ul 


In how many ways can three employees be chosen from a group: 
of ten to receive identical $4000 bonuses? 


Answer: Compare this problem with Example 5.4. In that case, 
the answer was 10 X 9 X 8 = 720, but again, that answer is too 
large for this example. There is only one way to give Jane, David, 
and Ann the same $4000 raise, but how many ways can we 
distribute three different raises among three people? The answer 
is 3 X 2X 1 because there are three choices for the big raise, two 
remaining choices for the middle raise, and only one remaining 
choice for whom to give the last raise, Thus 720 is 3X 2X 1 = 
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3! = 6 times too large for this example, and the correct answer is 
720/6 = 120. 


‘The answers to Examples 5.6 and 5.7 can be expressed in terms of factorials 
as follows: 6 X 5 has 2 factors, and we had to divide 6 X 5 = 61/4! by the 
number 2 or 2!. 10 X 9 X 8 has 3 factors, and we had to divide 10 x 9 x 8 


and 107! 


= 101/7! by the number 6 or 3!. Also, 61/4! divided by 2! is 
1 


6 
a 


divided by 31 is Similarly, if we divide n'/(m ~ 1)! by the number r!, 


the result is —" 
(= nin 


COMBINATION RULE 


The number of ways of choosing r distinct objects from m objects, where 
“order is not important,” is 'Y(n— 7)! r!, Each of the different ways of making 
this choice is called a combination, and the total number of such combinations 


is denoted by O(n,7) or (") Thus 
Se |) eee | 


Example 54 In how many ways can a company select four of its nine products 
~~ for an advertising campaign? (Note the difference from Example 
5.5) 


Answer: 


cos) = (:) * 


a ae 
4)" O= ara ~ 3a = 76 
For ease in future calculations, we note how to simplify factorial divisions. 


For instance, 


JO _ 10X9x8X7x@ 
@ @ 
 _ 8X7X6X5X4xXR 
x z 
When evaluating combinations, it is simplest to cancel the larger factorial in 
the denominator against part of the factorial in the numerator. 


=10X9X8x7 = 5040 


= 8X7X6X5X4 = 6720 


Example 5.9 
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We canceled the 4! in the denominator against part of the 7! in 
the numerator, and we were left with 3! = 3 X 2 in the 
denominator. Similarly 


12 12x 
CUZ) = 55 7 = 6 


rang) = BE = 019% 


is expected since there is only one way of choosing five items from a group 


of five when order is not important sina (4) = u()- 1, 610,10) = 1, 
6! 


etc, Another combination in which 0 ! arises has the form €(6,0) = 


(there is exactly one way to choose no item from a group of six.) 
Examples of other readily computed combinations are €(57.1) = S75 = 
57 (there are 57 ways to pick one item from a group of $7) and C3231) = 
32 (there are 32 ways to pick all but one item from a group of 32). Similarly, 
85,1) = 85 and 123,22) = 23, 
‘The results of other combinations are not as obvious and require some 
‘work. However, pattems can be noticed, For example, consider the following: 


-1 ()-« @)-5 @-< (> 
@)-1 ()-5 @- G-» (-5 B- 


‘These numbers form a pattern known as Pascal's triangle. 


If one line of Pascal's tangle is known, how can the next line be found? 
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Answer: The next line will stat’ and end with 1, Each other 
number can be found by adding the two numbers immediately 
above and slightly to the right and left. For example, the next line 
after the last one above is. 
1 145=6 5+10=15 10+10=20 10+5=15 5+1=6 1 


()-a0)=«()=(0)-»(i)-ol8)- 49) 
"). 


Finally, for later use, we comment on how to find (rhiewe know (- 


maettl s ) = (") (See Study Question 1) 


Erample 510. Given that 6) = 56, what is (i) 


15-1041 _ 6 
10 10 10 


EXERCISES 


1. In his novel Alaska, James Michener talks about the different factors that 
must combine to produce an ice age. Then, mathematically, he explains 
that “if four different factors in an intricate problem operate in cycles of 
13, 17, 23, and 37 years respectively, and if all have to coincide to produce 
the desired result, you might have to wait 188,071 years ....” Explain how 
Michener calculated 188,071. 

2. As reported in the Allentown Morning Call, Boston Chicken advertised that 
their customers had a choice of 3360 combinations of three side dishes 
from among their 16 side dishes offered. A high school teacher pointed 
out the mistake; can you? 

3. Johann Gregor Mendel characterized garden peas by the following seven 
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attributes: the stems are either long or short; the flowers are either red or 
white, and are either terminal or axial; the pods are either green or yellow, 
and are either constricted or inflated; and the seeds are either green or 
yellow, and are either smooth or wrinkled. How many different varieties 
Of garden peas are possible? 


Study Questions 


1. Show thar 2—£*1 


(2 jane for (7), and nore dat Mr 1)! = rland (n= r+ in- 
f=Qr— rt Dil 

2, Prove the formula (7) = ("7 ) + (7. }) which we used in Example 
5.10 to determine an entry in Pascal’s triangle from the two above entries. 
{Hlin: Write out the algebraic expressions for each of the two combinations 
on the right, and then convert to the common denominator: rn ~ 1) 

3. Suppose we have a set with » elements. Then the number of subsets that 
contain exactly r elements is C(n.7). What proportion of these subsets 
contain some designated element of the original set” (Hint: Reason that the 
answer is C(n ~ 1,r~ 1)/C(n.7), and show algebraically that this is equal 
worn 

4, a, What is the total number of subsets of a set with two elements? Of a 

set with three elements? 

b. Do you think that a set could have exactly ten subsets? Sixteen subsets? 


")-() (Hint: Write out the expressions for 


6 


PROBABILITY CONCEPTS 


‘The probability of a particular outcome of an experiment is a mathematical 
statement about the likelihood of that event occurring. Probabilities are alway 
between 0 and 1 with a probability close to 0 meaning that an event is unlikely 
to occur, and a probability close to 1 meaning that the event is likely to occur. 
‘The sum of the probabilities of all the separate outcomes of an experiment is, 
always 1. In this chapter we list the basic probability concepts that are needed 
for our discussion of probability distributions. 


‘The probability that an event will not occur, that is, the probability of its 
complement, is equal to 1 minus the probability that the event will occur: 


PO) =1- POD 


Tempe G1 If the probability that a company will win a contract is 3, what is 
—— the probability that it will not win the contract? 


Answer: 1 — 3 = 7. 


Addition principle 

If bwo events are mutually exclusive, that is, they cannot occur simultane- 
ously, then the probability that at least one event will occur is equal to the sum 
of the respective probabilities of the two events. That is: 


If PLXNY) = 0, then POXUY) = PUD + POY) 


where XMY, read *X intersect ¥," means that both X and Y occur, while XUY, 
read “X union ¥,” means that either X or ¥ or both occur. 


Eronple62 If the probabilities that Jane, Tom, and Mary will be chosen 

~~ chairperson of the board are .5, 3, and .2, respectively, then the 
probability that the chairperson will be either Jane or Mary is 
S+2= 7. 
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‘When two events are not mutually exclusive, then the sum of their prob- 
abilities counts their shared occurrence twice. This leads to the 


General addition formula 
For any pair of events Xand ¥, 


POXUY) = FOO + AY) - PANY) 


fl 


‘Suppose that the probability that a construction company will be 
awarded a certain contract is .25, the probability that it will be 
awarded a second contract is .21, and the probability that it will 
‘get both contracts is .13. What is the probability that the company 
will win at least one of the two contracts? 


Answer: 25 + 21 ~ 13 = 33 


principle 
If the chance that one event will happen is not influenced by whether or 


Trample 64 The probability that a student will receive a state grant is 1/3, 

—"—— while the probability that she will be awarded a federal grant is 
1/2. If whether or not she receives one grant is not influenced by 
whether or not she receives the other, what is the probability of 
her receiving both grants? 


Answer: 1/3 X 1/2 = /6 
‘The independence principle can be extended to more than two events; that 


is, given a sequence of independent events, the probability that all will happen 
is equal to the product of their individual probabilities. 


Example 65 Suppose a reputed psychic in an ESP experiment has called heads 
or tails correctly on ten successive tosses of a coin. What is the 


probability that guessing would yield this perfect score? 


Amswer: (1/2K1/2)..(1/2) = (1/2) = 1/1024 


In many applications, such as coin tossing, there are only to possible 
‘outcomes. For example, on each toss either the psychic guesses correctly or 
she guesses incorrectly, either a radio is defective or it is not defective, either 
the workers will go on strike or they will not walk out, either the manager's 
salary is above $50,000 or it does not exceed $50,000. In some applications 
such two-outcome situations are repeated many times. For example, we may 
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ask about the chances that at most one of four tires is defective, or that at least 
three of five unions will vote to go on strike, or that exactly two of three 
executives have salaries above $50,000. For applications in which a two-out- 
come situation is repeated some number of times, and the probability of each 
of the two outcomes remains the same for each repetition, the resulting 
calculations involve what are known as binomial probabilities. 


Tremp 66 Suppose the probability that a light bulb is defective is .1. What is 
——— the probability that four light bulbs are all defective? 


Answer: (AXIKDC1) = (1)* = 0001 


Eeample 67 Again suppose the probability that a light bulb is defective is .1. 
What is the probability that exactly two of three light bulbs are 
defective, 


Answer: We subdivide the problem as follows. The probability 
that the first two bulbs are defective, and the third is good, is 
(1AX.9) = ,009. (Note that, if the probability of being defective 
is .1, the probability of being good is .9.) The probability that the 
first bulb is good, and the other two are defective, is (9XDD 
= .009. Finally, the probability that the second bulb is good, and 
the other two are defective is (1.9X.1) = .009. Summing, we 
find that the probability that exactly two out of three bulbs are 
defective is 009 + .009 + .009 = .027. 


Example 68 If the probability of a defective light bulb is 1, what is the 
probability that exactly three out of eight light bulbs are defective? 


Answer: We can subxlivide the problem again. For example, the 
probability that the firs, third, and seventh bulbs are defective, 
and the rest are good, is 


(DEK IKI IKDC) = (1)5C9)% = .00059049 


‘The probability that the second, third, and fifth bulbs are defec- 
tive, and the rest are good, is 


CXL IDK IK = CIAO = 00059049 
As can be seen, the probability of any particular arrangement of 
three defective and five good bulbs is (.1)%.9)° = .00059049. How 


many such arrangements are there? In other words, in how many 
ways can we pick three of eight positions for the defective bulbs 
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(the remaining five positions will be for good bulbs)? From 
Chapter 5 the answer is given by combinations: 
& _ 8x7x6 
eas” 8a 
Each of these 56 arrangements has a probability of 00059049, 
‘Thus, the probability that exactly three out of eight light bulbs are 
defective is 56 X .00059049 = .03306744: 


= 56 


CRSKICS = FE PCOS =.03306744 


Erenple 65 Suppose 30% of the employees in a large factory are smokers. 
——— What is the probability that there will be exactly two smokers in 


a randomly chosen five-person work group? 


Answer: We reason as follows. The probability that a person 
smokes is 30% = .3, so the probability that he or she does not 
smoke is 1 — .3 = .7. The probability of a particular arrangement 
of two smokers and three nonsmokers is (3)4.7)° = .03087. The 
number of such arrangements is 15,2) = 51/213! = 10. Each 
such arrangement has probability .03087, so the final answer is 10 
X 03087 = 3087. 


y= 7) =. 
CG 2K3PC7Y = Fy (327) =.3087 


We can state the general principle as the binomial formula 


BINOMIAL FORMULA 


Suppose an experiment has two possible outcomes, called success and 
‘failure, with the probability of success equal to p and the probability of failure 
‘equal to q (of course, p + q = 1). Suppose further that the experiment is 
repeated 1 times, and the outcome at any particular time has no influence over 
the outcome at any other time. Then the probability of exactly xsuccesses (and 
thus =x failures) is 


Cin.) pf = ae pg 


A manager notes that there is a .125 probability that any employee 
will arrive late for work. What is the probability that exactly one 
person in a six-person department will arrive late? 


Answer: If the probability of being late is .125, then the prob- 
ability of being on time is 1 ~ .125 = .875. If one person of six is 
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late, then 6 — 1 = 5 will be on time. Thus the desired prob- 
ability is 


©(6,1.125)1.875)° = 6(.125.875)* = 385 


Many, perhaps most, applications of probability involve such phrases as at 
least, at most, less than, and more than. In these cases, solutions involve 
summing two or more cases. 
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‘A manufacturer has the following quality control check at the end 
of a production line: If at least eight of ten randomly picked 
anticles meet all specifications, the whole shipment is approved, 
If, in reality, 85% of a particular shipment meet all specifications, 
what is the probability that the shipment will make it through the 
control check? 


Answer: The probability of meeting specifications is .85, so the 
probability of not meeting specifications must be .15. We want the 
probability that at least eight of ten articles meet specifications, 
that is, the probability that exactly eight or exactly nine or exactly 
ten articles meet specifications. We sum the three binomial prob- 
abilities: 


racy of 10 Haactly $f 10 actly 10 of 10 
cet specifications meet speciation meet specications 


COOBX B5CISY + COO9K.85)°C15)" + CO010K.85)C15) 
= Ey caswcas? + 106859915)! + (859! 


= 820 


For the problem in Example 6.11, what is the probability that 
shipment in which only 70% of the articles meet specifications will 
make it through the control check? 


Answer: 


CAOBK.7A(3Y + CA09K.7(3) + CUOI0K.7C 3)" 
= 45(.7)8.3 + 10.7)9(3) + C79 = 383 


In some situations it is easier to calculate the probability of the complemen- 
tary event and subtract this value from 1. 
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Joe DiMaggio had a career batting average of 325. What was the 
probability that he would get at least one hit in five official times 
at ba 
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Answer: We could sum the probabilities of exactly one hit, two 
hits, three hits, four hits, and five hits. However, the complement 
of “at least one hit” is “zero hits.” The probability of no hit is 


(5.0K. 325.675)° = (.675)5 = 140, 


and thus the probability of at least one hit in five times at bat is 
1 — .140 = 860. 


A grocery store manager notes that 35% of the customers who buy 
particular product make use of a store coupon to receive a 
discount. If seven people purchase the product, what is the 
probability that fewer than four will use a coupon? 


Answer: In this situation, “fewer than four” means zero or one or 
‘two or three, 


CU70X.35)°.65)7 + COAX 35!(65¥ + C7.2K35)2.65)5 
+ COBK 35.65 
= 65)" + T3565) + 210.35)2(.65) + 35(.35)9.65)* = 800 


Sometimes we are asked to calculate the probability of each of the possible 
outcomes (the results should sum to 1). 


Grange 615 If the probability of a male birth is .51, what is the probability that 


4 five-child family will have all boys? Exactly four boys? Exactly 
three boys? Exactly two boys? Exactly one boy? All girls? 


Answer: 


POS boys) = C0555) 49 = C518 
PG boys) = C5.AX.51.49)! = 5(.51).49) 


PG boys) = C(5,3X5IX.49¥ = 100519.49¥ = 3185 
PQ boys) = C5.2KSIA49 = 10.51.49) = 3060 
PQ boy) = CS,.DCSD'C49 = 5-51K.49)" = 1470 
PCO boys) = C5,0X.51499 = (4a = _.0283 

‘7.0000 


A table such as the one in Example 6.15 shows the entire probability 
distribution, which in this case refers to a listing of the probabilities of all 


‘outcomes. 


For later use we need to know how to calculate the probability of x 
successes if we know the probability of x — 1 successes: 


Probability Concepts 57. 


Px successes) = Cin, x)p* g*-* 
PUx ~ 1 successes) = Cln,x ~ Dp 


In-(x-) = n- xt 


Note that P(x successes) has one more p and one less q than does 
P(x — 1 successes), so we should expect a factor p/q. Also, from 
Chapter 5 we have Ginx) = Mn — x + Wx lanx - 1) 
Combining these results gives 


+ 
SAVE px —1 successes) 
= 


Since q = 1 ~ p, this is often written 


PCx successes) = 


=xtl 


Px successes) = 
* 


pope 1 successes) 


Example 6.16 A sharpshooter can hit a bullseye target 95% of the time. Given 
that the probability of exactly eight bullseyes in ten shots is .0746, 
what is the probability of exactly nine bullseyes in ten shots? 


(0746) = 315, 


EXERCISES 


1. As reported in The New York Times (February 19, 1995), the Russian Health 
Ministry announced that one-quarter of the country’s hospitals had no sew- 
age system and one-seventh had no running water. What is the probability 
that a Russian hospital will have at least one of these problems if: 

a. the two problems are independent? 
>. hospitals with the running water problem are a subset of those with the 
sewage problem? 
2. In the November 27, 1994, issue of Parade magazine, the “Ask Marilyn” 
section contained this question: 
“Suppose a person was having two surgeries performed at the same 
time, If the chances of success for surgery A are 85%, and the chances 
of success for surgery B are 99%6, what are the chances that both would 
fail?” 

‘What do you think of Marilyn's solution: (.15X.10) = .015 or 1.5%? 

3. In November 1994, Intel announced that a “subtle flaw” in its Pentium chip 
‘would affect one in 9,000,000,000 division problems. Suppose a computer 
performed 20,000,000 divisions (a not unreasonable number) in the course 
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of a particular program. What is the probability of no error? Of at least one 
error? 

4. According to a CBS/New York Times poll taken in 1992, 15% of the public 
has responded to a telephone call-in poll. In a random group of five 
people, what is the probability that exactly two have responded to a callin 
poll? That at least two have responded? That at most two have responded? 

5. In a 1974 “Dear Abby” letter a woman lamented that she had just given 
birth to her eighth child, and all were girls! Her doctor had assured her 
that the chance of the eighth child being a girl was only one in 100. 

a, What was the real probability that the eighth child would be a girl? 
b. Before the birth of the first child, what was the probability that the 
woman would give birth to eight girls in a row? 

6. The yearly mortality rate for American men from prostate cancer has been 
constant for decades at about 25 of every 100,000 men. (This rate has not 
changed in spite of new diagnostic techniques and new treatments.) In a 
group of 100 American men, what is the probability that at least one will 
die from prostate cancer in a given year? 


Study Questions 

1, (Geometric distribution) Suppose the probability that someone will make 
4 major mistake on an income tax return is .08. One day, an IRS agent 
plans to audit as many returns as necessary until she finds one with a major 
mistake, What is the probability that a major mistake will be found on the 
first return? The second? The third? The fourth? The Afth? Note that, while 
the probability of a major mistake on any given return is always .08, the 
probability of which return the first major mistake is found on steadily 
decreases. The associated cumulative probability distribution is called a 
geometric distribution. Reason why, in this question, if the first three 
returns have no major mistake, the probability that more than five addi- 
tional returns will have to be audited to find a major mistake is the same 
as the probability at the beginning of the day that more than five returns 
will have to be audited. 


2. (Negative binomial distribution) During the 1990 World Series, the Oakland 
Athletics were heavily favored over the Cincinnati Reds. Suppose the prob- 
ability that Cincinnati would win any given game was 4/9. What was the 
probability that Cincinnati would win the series, that is, would win four 
games before Oakland won four games? [Hint The probabilities of separate 
cases must be summed: for each case, consider how many wins and losses 
occurred before the final game.] The above gives rise to what is called a 
negative binomial distribution. For a negative binomial distribution, as well 
as for the geometric distribution described in Study Question 1, the number 
of “successes” is fixed, while the total number of trials is allowed to vary. 
However, for the original binomial distribution, the number of “successes” is 
allowed to vary, while the number of trials is fixed. 


EXPECTED VALUES 


A bettor placing a chip on a number in roulette has a small chance of 
winning a lot and a large chance of losing a little. To determine whether the 
bet is “fair,” we must be able to calculate the “expected value” of the game to 
the bettor. An insurance company in any given year pays out a large sum of 
money to each of a relatively small number of people, while collecting a small 
sum of money from each of many other individuals. To determine policy 
Premiums an actuary must calculate the “expected values” relating to deaths 
in various age groups of policy holders. In this chapter we introduce the 
concept of a random tariable, and then show how to determine expected 
values or means and variances relating to this concept. In particular, we 
consider how to calculate means and standard deviations in the case of 
binomial probabilities. 


RANDOM VARIABLES 


In Chapter 6 we determined the probability for each outcome in various 
examples. Often each outcome has not only an associated probability, but also 
‘an associated real number. For example, the probability may be 1/2 that there 
are 5 defective batteries; the probability may be .01 that a company receives 
7 contracts; the probability may be .95 that 3 people recover from a disease. 
If X represents the different numbers associated with the potential outcomes 
of some chance situation, we call X a random variable. 


Example 71 A prison official knows that 1/2 of the inmates he admits stay only 

—— 1 day, 1/4 stay 2 days, 1/5 stay 3 days, and 1/20 stay 4 days before 
they are either released or are sent on to the county jail, If X 
represents the number of days, then X is a random variable that 
takes the value 1 with probability 1/2, the value 2 with probability 
1/4, the value 3 with probability 1/5, and the value 4 with 
probability 1/20. 


‘The random variable in Example 7.1 is called discrete because it can assume 
only a countable number of values. The random variable in Example 7.2, 
however, is said to be continuous because it can assume values associated with 
a whole line interval. 
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Trample 72 


Let X be a random variable whose values correspond to the 
speeds at which a jet plane can fly. The jet may be traveling at 
623.478 ... miles per hour or at any other speed in some whole 
interval. We might ask what the probability is that the plane is 
flying at between 300 and 400 miles per hour. 


A probability distribution for a discrete variable is a listing or formula giving 
the probability for each value of the random variable. 
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Concessionaires know that attendance at a football stadium will 
be 60,000 on a clear day, 45,000 if there is light snow, and 15,000 
if there is heavy snow. Furthermore, the probability of clear skies, 
light snow, or heavy snow on any particular day is 1/2, 1/3, and 
1/6 respectively. (Here we have a random variable X that takes 
the values 60,000, 45,000, and 15,000.) 


Outcome: clear skies light snow heavy snow 
Probability: v2 v3 v6 
Random variable: 60,000 45,000 15,000 


‘What average attendance should be expected for the season? 


Answer: We reason as follows. Suppose on, say, 12 game days 
we had clear skies 1/2 of the time (6 days), a light snow 1/3 of 
the time (4 days), and a heavy snow 1/6 of the time (2 days). Then 
the average attendance would be 


(6 X 60,000) + (4 x 45000) + (2X 15000) 47.500 


Note that we could have divided the 12 into each of the three 
terms in the numerator to obtain 


(& x e000) + ( x 45.000) + (2; x 15000) 


of, equivalently, 


(3 60000) +3 % 45000) + & x 15000) = 47500 
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Actually, there was no need to consider 12 days. We could have 
simply multiplied probabilities times corresponding attendances 
and summed the resulting products. 


EXPECTED VALUE (MEAN) OF A 
RANDOM VARIABLE 

The final calculation in Example 7.3 motivates our definition’ of expected 
value. The expected value (or average or mean) of a random variable X is the 
sum of the products obtained by multiplying each value x by the correspond- 
ing probability Pt: 


i 


kL 


EX) = Ex Po) 
Ina lottery, 10,000 tickets are sold at $1 each with a prize of $7500 
for one winner. What is the average result for each bettor? 


Answer: The actual winning payoff is $7499 because the winner 
paid $1 for a ticket, so we have: 


Outcome win lose 
A 1 9.959 

Probability 10,000 10,000 

Random variable 7499 -1 


raf —+_) 4 
tapeced vee = Tefal) *¢ 


‘Thus the average result for each person betting the lottery is a 25¢ 
loss. 


‘A manager must choose among three options. Option 4 has a 10% 
‘chance of resulting in a $250,000 gain, but otherwise will result in 
44 $10,000 loss. Option B has a 50% chance of gaining $40,000 and 
4 50% chance of losing $1000. Finally, option Chas a 5% chance 
of gaining $800,000, but otherwise will result in a loss of $20,000. 
Which option should the manager choose? 


The development bere i limited 10 random vanables with a finite number of values, but this 
definition can be euended somewhat using mine Sia. 
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Answer: Calculate the expected values of the three options: 


Option A Option B Option ¢ 
‘Outcome: gain loss gain — loss gin loss 
Probability: 10 90 50 50 05 95 


Random variable: 250,000 ~10,000 40,000 —2000 800,000 ~20,000 


ELA) = .10(250,000) + .90(~10,000) = $16,000 
LB) = .50K40,000) + 50-2000) = $19,000 
ELC) = .054800,000) + 95(~20,000) = $21,000 


‘The manager should choose option C! However, although Option Chas the 
greatest mean (expected value), the manager may well wish to consider the 
relative riskiness of the various options. If, for example, a $5000 loss would be 
disastrous for the company, the manager might well decide to choose option 
B with its maximum possible loss of $2000. (It should be intuitively clear that 
some concept of variance would be helpful here in measuring the risk; later 
in this chapter the variance of a random variable will be defined.) 


Suppose we have a binomial random variable, that is, a random variable 
whose values are the numbers of “successes” in some binomial probability 
distribution. 


Tremp 76 Of the automobiles produced in a particular plant, 40% had a 
certain defect. Suppose a company purchases five of these cars. 
‘What is the expected value for the number of cars with defects? 


Answer: We might guess that the average or mean or expected 
value is 40% of 5 = 4 X 5 = 2, but let’s calculate from the 
definition, Letting X represent the number of cars with the defect, 


we have: 

PO) = CEOKANLH = (698 07776 

PO) = C5,1X4)1.6) = 5CAX.6* = .25920 

PR) = CB2KAAGS = 1004.6) = 34560 

PR) = C53 4.6) = 10.4).6? = 23040 

PA) = C5AK ACO! = 54.6) = .07680 

PE) = CESKALOY = CA 01024 
Outcome: Oars ler Zar jor dc Sous 
Probability: 07776 25920 34560 23040 07680 01024 
Random variable: 0. 1 2 3 4 5 


HX) = 06.0776) + 1(.25920) + 2.34560)+ H.23040) + 41.07680) + 5(.01024) = 2 
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‘Thus, the answer tums out to be the same as would be obtained 
by simply multiplying the probability of “success” times the num- 
ber of cases. 


The following is true: If we have a binomial probability situation with the 
probability of success equal to p and the number of trials equal ton, the 
expected value or mean number of successes for the n trials is np. 

‘One proof of this statement involves algebraic manipulation with the 
binomial probability formula to show that SxP(x) = np. Another insight is as 
follows: 

If n= 1 we have: 


Outcome: success failure 
Probability: p Ip 
Random Variable (successes); 1 ° 


with expected value = K(p) + 01 — p) = p. For langer n, we consider each 
trial independently with a resulting expected value of p for each trial, The 
number of successes in 1 trials is the sum of the successes in all m trials, and 
so the expected value or mean of the trials is the sum of the expected values 
of all n trials. We calculate 


ptptpt..+p=mp 


One p foreach of m tas 


FJ An insurance salesperson is able to sell policies to 15% of the 
people she contacts. Suppose that she contacts 120 people during 
a 2-week period. What is the expected value for the number of 
policies she sells? 


Answer: We have a binomial probability with the probability of 
success .15 and the number of trials 120, so the mean or expected 
value for the number of successes is 120 X15 = 18. 


VARIANCE OF A RANDOM VARIABLE 


We have seen that the mean of a random variable is £xA.x). However, not 
only is the mean important, but also we would like to measure the variability 
for the values taken on by a random variable. Since we are dealing with chance 
events, the proper tool is variance. Variance was defined in Chapter 2 to be 
the mean average of the squared deviations (x — yx)? If we regard the (x)? 
terms as the values of some random variable (whose probability is the same 
as the probability of 2, then the mean of this new random variable is 


64 Probobility 


= Gp? PQ), which is precisely how we define the variance 0? of a random 
variable: 
of = Xx - mP P(x) 
‘As before, the standard deviation @ is the square root of the variance, 
78 A highway engineer knows that his crew can lay 5 miles of 


highway on a clear day, 2 miles on a rainy day, and only 1 mile 
ona snowy day. Suppose the probabilities are as follows: 


Outcome: clear rain snow 
Probability: 6 3 1 
Random Variable 

Aaniles of highway): 5 2 1 


‘What are the mean (expected value) and the variance? 
Answer: 
b= LaPx) = 566) +3) +1) = 37 
oF = Ex - pP PC) 
= G — 37¥C6) + (2 — 37P(3) + G — 37FCD) = 261 


Look again at Example 7.6. We calculated the mean to be 2, What 
is the variance? 


Answer: 0? = (0~2)4.07776) + (1~2)%.2592) + (2-2)%.3456) 
+ B-2)°C.2304) + (4-2)%.0768) + (5~2)%.01024) = 1.2 


Could we have calculated the above result more easily? In this case, we have 
inomial random variable, so we will use the same type of argument that we 
used in showing that p= mp. We first consider the binomial distribution for 
nek 


Outcome: success failure 
Probability: ra q 
Random variable (number of successes) 1 ° 


Now = 1) + 40) = p.so 


o = (1 — wep + © - pq 


= p+ Pq iu = pand1~p=q4 
= ptp+@ factoring] 
= Pq [p+q=Ul 


For larger n, we reason as follows. The number of successes is the sum of 
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the successes in all trials, so the variance is the sum of the variances from all 
trials (Chapter 2): 


q+ pqt pat. + pa = npq 


ne po fr ach of tle 


More formally, this result can be obtained through algebraic manipulations (see 
Study Question 3). 


Trap 7. 


How can we use this method to more simply calculate the 
variance in Example 7.9. 


Answer: npq = (AX.6) = 1.2. 
‘Thus, for a random variable X, 
Mean or expected value x = Ex P(x) 
Variance o? = E(x ~ p)? P(x) 
Standard deviation @ = YEG — pP PX) 


In the case of a binomial probability distribution with probability of success 
equal to p, and number of trials equal to n, if we let X be the number of 
successes in the 1 trials, the above equations become: 


Mean or expected value x = np 
Variance o? = pq = np - p) 


Standard deviation a = Ynpq = Ynpl— py 


Sixty percent of all new car buyers choose automatic transmis- 
sions. For a group of five new car buyers, calculate the mean and 
standard deviation for the number of buyers choosing automatics 


Answer: 
b= np = %6) = 30 
o = (mp — p) = VXOKA) = 
Note that this could have been calculated through the more 
involved 


b= Ex Pox) = OA)] + 1566.44] + A100664)3] 
+ M1069.4F] + 415.6449] + 5165] = 30 


o = (EG — per 
= J9.010241+ 407680] + 1123040] + 0134560] + 1125920] + 407776) 
=u 
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EXERCISES 


1, Alan Dershowitz, one of OJ. Simpson's lawyers, has stated that only one 
‘out of every 1000 abusive relationships ends in murder each year. If he is 
correct, and if there are approximately 1,500,000 abusive relationships in 
the United States, what is the expected value for the number of people 
who are killed each year by their abusive partners? 


2. 1f 3% of the population is allergic to the malaria fighting drug chloroquine, 
what is the expected value for the number of allergic people in a town of 
1200 people? 

3. A television game show has three payoffs with the following probabilities: 


Payoff ($); ——«1000—10,000 
Probability: 6 3 1 


‘What are the mean and standard deviation for the payoff variable? 


4 Companies proved to have violated pollution laws are being fined various 
amounts with the following probabilities: 


Fine (8) 1000 10,000 $0,000 100,000 
Probability 4 3 2 1 


What are the mean and standard deviation for the fine variable? 


5. The New York Times (September 21, 1994) reported that an American 
woman diagnosed with ovarian cancer has a 37.5% chance of survival, and 
that approximately 20,000 American women are diagnosed with ovarian 
cancer each year. 

a. What is the expected number of deaths annually from this disease? 
b. What are the mean, variance, and standard deviation for a binomial 
with 1 = 20,000 and p = .625? 

6. Of the coral reef species in the Great Barrier Reef off Australia, 73% are 
poisonous. If a tourist boat bringing divers to different points off the reef 
encounters an average of 25 coral reef species, what are the mean and 
standard deviation for the expected number of poisonous species seen? 


Study Questions 

1. A famous problem asks you to choose between two envelopes, one of 
which has twice as much money as the other. You arbitrarily pick one, 
open it, and find $100. You are then given the chance to switch envelopes. 
You reason that the other envelope has either $50 or $200, each with a 5 
probability. Applying your understanding of expected value, you calculate 
5($50)+.5($200) = $125 and conclude that you should switch envelopes. 
Comment on this reasoning. 
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2, Show algebraically that in the case of a binomial distribution, the formula 
= Lx Aad reduces to np. Hint 


ka — pyr + pa peasdl Ppa - pro] 
+ feces pia- py] tot nlp) 

= prot eon Da ~ prep + BMAD psp 
tect pel = mpl — p) + plot = 


3, Show algebraically that, in the case of a binomial distribution with = 2, 
the formula o* = E(x — pw)? Ax) reduces to np ~ p). [Hint Use the result 
from Study Question 2 that w= mp) How about a binomial distribution 
with n= 3 


Part 3 


PROBABILITY DISTRIBUTIONS 


With an understanding of binomial probabilities and expected 
values, we can now develop and explore several probability distri- 
butions of general interest in statistics. Such knowledge not only is 
necessary for our future study, but also is immediately useful as a 
decision making tool for certain classes of problems. 

For example, knowing the probability of finding oil given certain 
geological conditions, we can use the binomial distribution to cal- 
culate the probabilities of various numbers of positive strikes for a 
given number of test sites. Knowing the average number of Supreme 
Gourt vacancies during previous presidential terms, we can use the 
Poisson distribution to calculate the probabilities of various num- 
bers of vacancies arising during the next 4-year term. Knowing the 
mean and variance of beights of US. Marines, we can use the 
normal distribution to calculate the probability that any Marine bas 
a beight greater than a specified value. 

Our discussion in Chapter 8 of the properties of the binomial 
distribution arises from the concepts and calculations of Chapters 
6 and 7. The Poisson distribution in Chapter 9 can be viewed as a 
limiting case of the binomial when n is large and p is small, and 
the normal distribution in Chapter 10 as a limiting case of the 
binomial when p is constant but n increases without bound. With 
this approach, it will be clear that both the Poisson and the normal 
distributions can be used as approximations to binomial problems. 
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Finally, it cannot be overstressed that, even if there were very few 
naturally occurring normal distributions, the normal bas tremen- 
dous value in that it describes the distribution found in a wide 
range of statistical experiments, investigations, and studies. This 
application will be the focus of Parts 4 and 5. 
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BINOMIAL DISTRIBUTION: 


Suppose an experiment or situation has a finite number of possible out- 
comes. Each outcome has an associated probability, and there are various 
ways to display these probabilities. Two primary ways are tables and histo- 
grams, 


Greg KT A switchboard operator notes that for any I-minute period, the 
——— probabilities for no, one, two, and three calls are .3, 4, 2, and .1, 
respectively. We display these probabilities as follows: 


IW ATABLE and IN A HISTOGRAM: 
umber of Calls Probability —-«_Probabiity is relative oreo 


The set of probabilities (3, 4, .2, .1) in Example 8.1 is called a probability 
distribution. Over a large number of L-minute periods, we would expect no 
calls in approximately 30% of the periods, one call in 40%, two calls in 20%, 
and three calls in 10%. The larger the number of I-minute periods considered, 
the closer we would expect the observed percentage to be to these theoretical 
percentages. 

In this chapter we consider probability distributions arising from binomial 
probabilities. The concept of binomial probability developed in Chapter 6 
can be expanded in various directions. We will consider tables, histograms, 
and the notions of mean and variance, and will then look at several appli- 
cations. 


n 
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To review, a binomial experiment has the following characteristics: 


1, The experiment consists of m identical tials, 

2. Each trial has the same two outcomes, commonly called success and 
‘Sailure, with probabilities p and 1 ~ p, respectively. 

3. The probabilities of success and failure remain the same from trial to 
trial; that is, the outcome of any one trial has no effect on the outcome 
of any other trial 

4. The mean is p= mp. 

5. The standard deviation is o = Vnpil — p) 


‘The probability is 1/3 that a supermarket customer will spend over 
$50, Suppose that there are four customers at a checkout counter, 
What is the probability distribution for the number of these 
customers spending over $50? 


! 


Answer: 


PCO) = CU4,0K1/39X2/3" = 102/39 
POD = CU4,1X1/3)2/3) = 41/32/38 
PQ) = C4.2K1/3P2/3F = 60/3423" 
PR) = CU4SKA/B23)! = 4/3)N2/3) 
PAA) = C4 AKA/3)2/3) = 11/3) 


SRS 


Probability is relative 


oa 


‘Comer spe ve 380 


‘The mean and standard deviation of the probability distribution in Example 
82 are as follows: 


n= m= co) = 1333 o = WP = (3) =943 
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‘The probability is .6 that a well driller will find water at a depth 
of less than 100 feet in a certain area. Wells are to be drilled for 
six new home owners. What is the complete probability distribu- 
tion for the number of wells under 100 fee? 


Answer: Using the appropriate line (1-6-15-20-15-6-1) from Pas- 
cals triangle (Chapter 5), we have: 


FO) = 1.6)%.4 = 004 
RU) = 6.6)'(4)' = 037 
FQ) = 15.644) = 138 
FR) = 20.6 = 276 
Ra) = 15.64) = 311 
PS) = 66)%.4)! = 187 
FO) = 6AP = 047 

7.000 


Wels under 
100 feet Probebiity 


3 


+ T T 
° 1 2 3 4 5 6 


‘Wells under 100 feet 


‘The mean and standard deviation of this probability distribution are calcu- 
lated as follows: 


B= mp = 66)= 36 0 = np — p) = VKGXA = 12 
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Itis useful t0 also label the horizontal axis with >scores (see Chapter 4). In 
this case, zscores of 0, 1, and 2 correspond to 3.6, 48, and 6.0, while =scores 
of =1, -2, and 3 correspond to 2.4, 1.2, and 0, respectively. 


Example 84 A baseball player with a batting average of .250 has 12 official 
at-bats in a three-game series. What is the probability distribution 
for the number of hits he makes? Display in a histogram. 


Answer: The probability of a hit is 250, and we have 


PO) = C12,0%.250%.750) = 1075) = 032 
POD) = C112,1%.250) 750)!" = 12.25.75)!" = 127 
PQ) = C12,2K.25071.750)!” = 66.25.75)" = .232 
PR) = CU12,3X.250)4.750" = 2206.25).75)" = 258 
PAA) = CU2,AX.250)C 750)" = 495(.25).75)" = 194 


PG) = CU12,5X.250)%.750)" = 7926.25)%.75)" = 103, 
FG) = C12,6X.250).750%" = 9240.25)°.75)" = .040 
PO) = C112,7.250)7.750)° = 7926.25)7.75)° = 012 
PRB) = CU2,8X.250)%.750)" = 495(.25).75)* = 002 


To three decimal places A9) = P10) = P11) = P12) = 0. The 
mean number of hits is 


B= np = 12.25) = 3 
and the standard deviation is 
o = (np — p) = VIX2SKID = 15 


Thus, zscores of 0, 1, and 2 correspond to 3, 4.5, and 6, while 
scores of —1 and ~2 correspond to 1.5 and 0, respectively. 
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6 Number of its 


Note how the value of p affects the shape of the distribution as displayed 
in a histogram. For n = 10, the following are the histograms for p = 5, 
b= A,and p= 7. 


For p = .5 the distribution is symmetric with the greatest probability occurring 
at the center. For p = .7 the high point of the histogram is shifted to the right, 
and for p = .1 it is shifted even farther to the left. 

The following examples and exercises illustrate direct applications of the 
binomial distribution. 


Example 33 A common decision-making problem arising in manufacturing 
——— situations involves whether to accept a shipment of raw materials 
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or of finished products. Concerns over quality control are some- 
times handled by inspecting a sample before accepting a whole 
shipment. The shipment is accepted only if fewer than some 
specified number of defects are found in the sample. Decisions on 
sample size and allowable number of defects are specific to the 
given situation and must take into consideration time, money, 
quality needs, and other relevant factors. 


In analyzing a proposed sampling plan, it is important to calculate 
the probability of shipment acceptance given various possible 
levels of product defect. For example, suppose the decision- 
making rule is to pick a sample of size m = 10, and accept the 
whole shipment if the sample contains at most one defective item, 
What is the probability of acceptance if the defect level of the 
shipment is actually 5%? 10%2 20% 30%? 4096? 50%? 


Answer: 
For p = 05, 
P (accept) = PCO def) + PU def) 
= €(10,0%.05)°L 95) + €(10,1.05)1( 95)" 
‘Thus we have: 


For p= .05, accepd) = (.95)! + 10.05.95)” = .914 
For p= .10, accept) = (.90)! + 106.1090)? = .736 
For p = .20, accept) = (.80)" + 10(.20X.80)° = .376 
For p = 30, Maccepd) = (.70)!° + 10(.30X.70 = 149, 
For p = 40, accept) = (60) + 10(40X.60)° = .046 
For p = .50, accept) = (.50)" + 10(.50%.50)” = .011 


‘The graph of the above values is called the operating charac- 
teristic curve for this sampling plan. 


10 
Probability 

of Ss 
acceptance 
P (accept) 


Note that, 
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‘p= 0, thats, if there is no defective item in the entire shipment, 


then Placcept) = 1. On the other hand, if p = 1.0, thats, ifthe entire shipment 
is defective, then accept) = 0. 


Example 8.6 


Number of new employees who passed: 0 1 203° 4 
Number of firms: 2.6 13 2 2% 16 1 


‘We can test the randomness of certain happenings by observing 
how closely the results follow an expected binomial pattem. For 
example, suppose there are 100 accounting firms each with 10 
new employees. We want to test the hypothesis that the new 
‘employees joined the firms randomly, and for data we have results 
from the CPA exam taken by all the new employees. Suppose that 
in 2 firms no new employee passed the exam, in 6 firms one new 
‘employee passed, in 13 firms two new employees passed, in 20 
three passed, in 28 four passed, in 16 five passed, in 11 six passed, 
in 4 seven passed, and no firms have more than seven new 
‘employees who passed the CPA exam, 


0 


56789 
4000 


8 
o 


Would the above results be likely if the new employees joined the 
various firms randomly. 


Answer: If these new employees were distributed at random 
among the firms, what would be the distribution with regard to 
number of new employees who passed the CPA exam? First, we 
determine the probability, p, of passing the exam by summing the 
‘number of the 1000 new employees who passed and then dividing 
by 1000. 


2X F GY + OSs XH + HHH) + EK FXO FD 


<3" 


700) 


Now, what is the probability that a firm had none of its ten new 
employees pass? One new employee pass? Two out of ten pass? 
‘Three? Four? We recognize this question as a binomial with p = 
378 and n = 10, Using the appropriate line from Pascal's triangle, 
1-10-45-120-210-252-210-120-45-10-1, we calculate: 


PO) (622) 

PA) = 10.378X.622 = J 
FQ) = 45(378)X.622F | = 145 
PGR) = 120(378)%.622)" 

FA) = 210(378)%.622"° 

FS) = 252(.378)%.622)* 

FG) = 210(.378).622)" 


78 Probability Distributions 


FT) = 120(.378)7(.622) = .032 
FB) = 45(.378).622)? | = .007 
9) = 10(378)%.622) = 001 
FAO) (378) = .000 


Multiplying each of these probabilities by 100 (there are 100 firms) 
gives the “expected” number of firms with 0 employees passing, 
‘one employee passing, two employees passing, and so on. 


Number of new employees who passed: 0.12 3 4 5 6 7.89 10 
Expected number of firms: 15 15 232 1 93100 
Actual number of firms: 26 13 20 2 16 11400 0 


Are the “expected” and “actual” numbers close enough to jus 
the conclusion that we have a binomial distribution, and thus that 
the new employees who passed the CPA exam are randomly 
distributed among firms? The numbers do appear to be very close, 
and we probably would conclude that the variations in exam 
results between firms were due to rindom influences, 


In Chapter 19 on chi-square analysis for goodness-of-fit tests, we will discuss 
‘@ mathematical technique for measuring the strength of agreement between 
two such sets of numbers. Meanwhile, the type of analysis illustrated in 
Example 8.6 is standard and useful. A similar angument in the next chapter will 
tuse the Poisson distribution instead of the binomial. 


EXERCISES 


1. It has been shown that in major league baseball games, home teams win 
approximately 35% of the time. Suppose a team plays a three game home 
series. What is the probability distribution for the number of wins? 

2. Suppose USAir accounts for 20% of U.S. domestic flights. 

a. If there are seven major air disasters in the United States, what is the 
probability distribution for the number of these disasters for which 
USAir might be accountable? 

b. What is the probability that USAir accounts for at least four of the 
disasters? (USAir actually did account for four of the last seven major 
disasters. “That's enough to begin getting suspicious but not enough to 
hang them,” said Dr. Brad Efton in The New York Times, September 11, 
1994.) 

3. A study reported by Liu and Schutz states that in hockey the better team 
has a 65% chance of scoring the first goal in overtime. 


escoui Danser ” 


a. If a team is considered to be the better team in all five of its overtime 
‘games, find the probability distribution for the number of these games 
in which that team will score the first overtime goal. 

b. Find the mean and standard deviation for this distribution. 

¢. Draw a histogram showing two horizontal axes, one for number of first 
overtime goals, and one for z-scores. 

4. The New York Times, June 21, 1993, reported that one out of every three 
people in the world is infected with tuberculosis (although most cases are 
latent. 

a. For a group of ten people, find the probability distribution for the 
number who may have tuberculosis. 

bb Find the mean and standard deviation for this distribution. 

¢. Draw a histogram showing two horizontal axes, one for number of 
people with tuberculosis, and one for z-scores. 

5. An inspection procedure at a manufacturing plant involves picking three 
items at random and then accepting the whole lot if at least two of the 
three items are in perfect condition. If in reality 90% of the whole lot are 
perfect, what is the probability that the lot will be accepted? If 75% of the 
whole lot are perfect? If 60% are perfect? 


Study Questions 
1. Seeds of a new variety are planted in 50 rows with eight seeds to a row. 
‘The number of germinations per row is noted, with the following results 


Number of germination: 0.12345 6 78 
Actual number of rows: 0.0125 8 18 12 5 
How many of the 400 seeds germinated? 
. What is the probability that a seed germinates? 
¢. What is the binomial probability distribution with this probability and 
with 1 = 8 
d. Mukiplying each probability in this distribution by 50 gives how many 
expected numbers of rows for each number of germination’? 
€. Do the expected numbers scem to be close enough to the actual 
numbers that we can conclude the seeds were randomly distributed 
among the rows? 
2. A new antibiotic is tested on six patients in each of 125 hospitals. The 
number of positive responses among the patients in each hospital is noted 
with the following results: 


ve 


a. How many patients had a positive response? 
b. What is the probability that a patient will have a positive response? 
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- What is the binomial probability distribution with this probability and 
with n = 6 

|, What is the expected number of hospitals for each number of positive 
responses? 

. Do the expected and actual numbers of hospitals seem to be close? (If 
the actual numbers are close to the numbers predicted by the binomial 
distribution, we will be more confident in concluding that the results 
are due solely to the antibiotic and not to differences in the hospitals 
and other factors.) 
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POISSON DISTRIBUTIONS 


To calculate a binomial distribution, we must know 1, the number of cases 
under consideration. In other words, to find the probability of a certain number 
Of successes, we must also know the corresponding number of failures, 
However, there are situations in which the number of failures cannot be 
determined. For example, in calculating the probability that a hospital will have 
two appendectomy cases in 1 day, it makes little sense to ask how many 
appendicitis cases will not occur. In calculating the probability that a baseball 
team will score five runs in a game, it is meaningless to ask how many runs 
will not be scored. And in finding the probability that there will be ten 
incoming telephone calls at a switchboard, it is impossible to say how many 
calls will not come in. 

In a wide variety of examples, both n, the number of cases, and p, the 
probability of success, are unknown. In this chapter we consider the special 
cases in which 1 is very lange, and p is very small, but the mean or average 
4 = np is both moderate and known. For example, a lange number 1 of planes 
fly into a major airport, and the probability p that any particular plane will be 
‘on the runway at any specific moment in time is small, however the average 
number np of planes waiting in line on the runway at noon each day can be 
determined without evaluating either p or 7. Similarly, a large number n of 
machine pans could pass through distribution center each day, and the 
probability p that any particular item would be defective might be small; 
however, the average number np of defective items found each day could be 
noted without knowing either p or n. For such situations we develop a 
technique for calculating the probabilities of various numbers of “successes.” 


PROBABILITY OF ZERO SUCCESSES 


First we consider the problem of determining the probability of no suc- 
cesses, We have seen that for a binomial experiment with 1 cases and 
probability of success p, the probability of one failure is 1 — p, and the 
probability of m consecutive failures is (1 — p)". We introduce the mean j= 
np algebraically by noting that 


1 1 
n=Lipm=1y sm a- pr =a prem = [a — pr 
ppm = 5H PD Pi [a- pot 


What can be said about (1 — p)"# in our special case concerned with a small 
value of p? 
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A hand calculator gives the following resuls: 


P 5 1 001.0001 0000000001 
(=p; 25000 348683677036 788 36788 


Mathematically, it can be shown that, as p is chosen to be closer and closer to 
0, =p)" is closer and closer to 1/e, where e is a mathematical constant that 
arises in a wide variety of mathematical fields and applications. 


36788... 


e= 27182... and + 
e 
“Thus, for small p, the probability of no successes, 


[a - pep 
is approximately (1/e)* or &# 


A supermarket manager knows that an average of four persons 
will pass through the checkout line during any 10 minute period. 
a, What is the probability of no customer in a 10 minute period? 
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Answer: © = 018 


'b, What is the probability of no customer in a five-minute period? 
In a 1-minute period? 


Answer: If the average in 10 minutes is four persons, then the 
average in 5 minutes is 4/2 = 2 persons, and the average in 
1 minute is 4/10 = 0.4 people. The probability of no customer in 
a S-minute period is thus e? = .135. In a I-minute period the 
probability is e-°4 = .670. 


‘A national highway study claims that an average of 1.7 accidents 
‘occur for every 100-mile length of interstate highway every year, 
a. What is the probability of no accident on any 100-mile stretch? 


Answer: &!” = 183 


b. What is the probability of no accident in a 200-mile stretch of 
interstate highway? In a 10-mile stretch? 


Answer: In a 200-mile stretch the yearly average will be 201.7) = 
3.4 accidents, so the probability of no accident is e~*4 = 033. For 
a 10-mile stretch the average is 1.7/10 = 0.17 accident, and so the 
probability of no accident is e°"" = 844. 
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PROBABILITIES OF POSITIVE NUMBERS 
OF SUCCESSES 


The key to proceeding from the probability of no success to the probabilities 
of one or more successes is the binomial formula from Chapter 6: 


Px successes) = ree oth ~ 1 successes) 


In our case p is small so 1 ~ pis approximately equal to 1, and 7 is lange so 
n~x+1 is approximately equal to n. In this situation 


and our formula reduces to 


PCx successes) = 


Combining this formula with the derivation for the probability of no success 
leads to the following: 


PQ successes) = 


success) = # enw 
Pa y= 8 


cscs) = H[ten] 0 
PQ successes) 4[ ] 


2 
successes) = #[M on] a Ben 
mssucenes = t[tten]= He 


PU successes) = 


and, more generally, 


Trengle 93 In a large northeastem town there is an average of 1.5 business 
bankruptcies per week. What is the probability of no bankruptcy 
in a week? One bankruptcy? Two bankruptcies? Three? 


Answer: There are a large number of businesses, the probability 
of bankruptcy is small, and the average is known. Therefore, we 
apply the formula we developed to obtain: 
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P(O bankruptcies) = e-15 = 2B 
PUlbankruptcy) = 15e-* = 335 
P(2bankruptcies) = [(15#/2le"5 = 251 
PG bankruptcies) = [(15)8/3!Je-5 = 126 


Tremp 94 A grocery store manager knows that his store sells an average of 
three cans of artichoke hearts per week. Assuming that this 
situation is described by the formula developed above, what is the 
probability that no can of artichoke hearts will be sold in 1 week? 
‘The probability of one can in 1 week? Two cans in 1 week? One 
can in 1/2 week? Three cans in 2 weeks? 


Answer: 

For 1 week the average is three cans, so 
P(Ocans in 1 week) = e~> = 050 
P(.can in week) = 3e"3 = 49 
P(2cansin 1 week) = (32/203 = 224 


For 1/2 week the average is.5(3) = 1.5, 50 
PC1can in Y2week) = 15e15 = 335 
For 2 weeks the average is 2(3) 


PG cans in 2 weeks) = (64/3) 


POISSON PROBABILITY DISTRIBUTION 
A probability distribution given by 
Pex successes) = Hee 


where is the mean and e = 2.71828... is called a Poisson probability 
distribution. We have seen that a Poisson distribution will result as the limiting 
case of 2 binomial distribution for large m, small p, and moderate ju. The 
Poisson is often the appropriate distribution description in situations in which 
the probability of one occurrence during a short interval is proportional to the 
Jength of that interval, 


Example 95 One of the first noted examples of a Poisson distribution involves 
——— the umber of deaths from horse kicks in various corps of the 
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German army in the late 1800s. More specifically, during the 
20-year period from 1875 to 1894, among the 14 cavalry corps of 
the German army, there was an average 0.7 deaths per corps per 
year as the result of horse kicks. If the deaths followed a Poisson. 
distribution, the descriptive probabilities per corp per year would 


be as follows: 
PCO deaths) = e~*7 
PA death) = 07e-«7 
PQ deaths) = (07)%/2le-®7 
PG deaths) = ((0.7)5/3!le"°7 
POA deaths) = (07)*/4 le-®? = .005 


1000 
If these are the correct probabilities, then the expected numerical 


distribution, found by multiplying the probabilities by 280 (20 
years X 14 corps = 280 groups), is as follows: 


Predicted Nomber of Groups 
497 X 280 = 139.2 
348 X 280 = 97.4 
122 x 280= 34.2 
028 X 280. 7.8 
005 280 


° 
1 
2 
3 
4 


14 


E-7) 


How accurate is this description? The actual figures were as 
follows: 


‘Actual Nomber of Groops 


4 
on 
32 
n 
2 


How close are the actual and predicted results? In Chapter 20, we will 
develop a measurement for “closeness of fit.” 


Example 96 During a recent leap year, an average of 3.26 new births per day 
—— occurred in Tompkins Community Hospital in Ithaca, New York. 
If the distribution were Poisson, the probabilities would be 
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oo 
3.265% 
(267/213 
(2673 
(2074 
2075 
(3.2676 Yes 
(G26777 Ne 
(3.26078 er3* 
1.26979 ters 
13.26"/10e* 


0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
0 


On the basis of these probabilities we have 


z 
Fi 


Expected Number of Occurrences 


038 366 = 14 
125 x 366 = 46 
204 X 366 = 75 
222 x 366 = 81 
IB] X 366 = 66 
118 x 366 = 43 
064 X 366 = 23 
030 x 366 = 11 
O12 x 6 = 4 
004 x 366 = 2 
002 x 366= 1 


Seavansunno 


How accurate is this description? The actual figures obtained from 
working through hospital records are as follows: 


Births on 1 doy ‘Actool Number of Occrrences 
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Trample 97 The Poisson is used in the following type of hypothesis test 
Suppose that observations taken before a company began to 
dump pollutants into a river indicated that an average of three 
trout per hour swim past the dump site. An inspector plans to 
spend an hour at the site, and will issue a warning if she sees 
fewer than three trout 


a. If the pollutants have no effect on the trout, so the mean per 
hour is sill three, what is the probability that a warning will be 
issued? 


Answer: 


Alless than 3) = FO) + A) + 2) = &°3 + 303 + 4/29 = 423 


b. Ifthe pollutants kill haf of the trout, what is the probability that 
a warning will of will not result? 


Answer: In this cas 


the mean is .5(3) = 1.5, and thus we have 


Powarning) 


eS + 15e15 + [1522/2le“5 = 809, 


P(no warning) 


309 = 191 


Note that, with this inspection procedure, there still might be a warning 
(423 probability) even if the pollutant is harmless, and there might be no 
‘warning (.191 probability) even if the pollutant kills half the fish, Different 
testing procedures will give different probabilities. Many factors go into deter- 
mining what testing procedure is used, and mathematical calculations such as 
those shown here are among the most important. 


POISSON APPROXIMATION TO 
THE BINOMIAL 


In this chapter we used the binomial to derive the formula for the Poisson 
distribution. Altematively, an important application of the Poisson is as an 
‘approximation to the binomial. The Poisson approximation is of course 
valuable when pand mare not known, but the mean np is known. However, 
the Poisson approximation is useful even when p and 1 are known, because 
the calculation of a Poisson probability is simpler than the calculation of the 
corresponding binomial probability. A rough rule of thumb is that the resulting 
approximation is “close” provided that 
n=20 and p=o05 


Of course, how close is close enough, will depend on the degree of 
accuracy required in the particular application. 
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Example 98 In a binomial distribution with p = .02 and n = 70, what is the 
actual probability of three successes, and what is the Poisson 
approximation? 


Answer: We calculate this probability in two ways: 


Binomial: The probability of success is 02, so the probability of 
failure is 98. Thus we have 


029598)" = 1131 


70! 
PO successes) = 5 


Poisson: The mean is 70(.02) = 1.4, and so 


= 1128 


Note how much easier the Poisson calculation is than the binomial cal- 
culation, 


Trompe 99 1f 496 of the population is colorblind, what is the probability that 
in a group of 30 people, exactly one will be color-blind? At least 
‘one will be color-blind? 


Answer: To use the Poisson approximation to thi 
first calculate the mean to be 30.04) = 1.2. Then 


binomial, we 


PC success) = 12e7!2 = 361 


Pat least 1 success) = 1 — P(O successes) = 1 — e-!2 = 699 


A delivery of ten items is received from a manufacturing plant at 
which 5% of the items produced are known to be defective, What 
is the probability of no defective items among the ten items? Of 
exactly one defective item? 


Answer: Since n = 10 is not large, we do not expect to obtain a 
very good approximation to the normal using the Poisson (how- 

ver, the result is surprisingly close). The mean is 10(.05) = 0.5. 
Then 


P(O defectives) = e-°5 = 607 (Actual answer is 599. ) 
P( defective) = 05e-°5 = 303 (Actual answer is 315.) 
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HISTOGRAMS 


Finally, we obtain further insight into the shape of a Poisson distribution by 
looking at histograms for several different values of the mean 4. 


a | 
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EXERCISES 


In each of the following exercises, assume that the distribution is Poisson 

1. According to the Los Angeles Times, during the past 5 years 1.6 elephant 
trainers, on average, were killed on the job in the United States every year. 
What is the probability of no death of an elephant trainer in 1 year? Of no 
deaths in a 2-year period? Of no deaths in 5 years? 

2. The New York Times reports that large meteorites strike our atmosphere 
with the intensity of atomic bombs an average of eight times a year. What 
is the probability of no such meteorite strike in 1 year? Of five strikes in 1 
year? Of eight in 1 year? 

3. Angry executives of a company claim that the telephone switchboard 
operator makes an average of two wrong connections every hour, while 
the operator claims that he makes only one per hour. The manager of the 
secretarial staff decides to nun a 2-hour test. If the operator makes four or 
more wrong connections in the 2 hours, he will be replaced. What is the 
probability that he will be replaced if the executives are correct? If he is 
correct? 

4, A trucking company plans to purchase a new tire for its trucks. The owners 
decide to run a preliminary test using the new tires on a small number of 
their fleet of trucks. If there are no more than three flat tires in an initial 
100,000 miles, the new tire will be accepted. What is the probability of 
acceptance if the tires actually average one flat per 50,000 miles? If they 
average one flat per 10,000 miles? 

5. There were 231 deaths at an upstate New York hospital during a recent 
leap year. This is a binomial, but is large and unknown while p is small 
and unknown, so we can use the Poisson as an approximation. Mathe- 
matically, on how many days would we have expected no deaths? One 
death? Two deaths Three deaths? Four deaths? (The actual numbers were 
surprisingly close: 201 days with no deaths, 118 days with one death, 33 
days with two deaths, 10 days with three deaths, 3 days with 4 deaths, and 
1 day with five deaths.) 


Study Question 
Regarding the Poisson as a limiting case of the binomial, determine the 
variance of a Poisson distribution with mean ys. [Hint: What is the variance 
of a binomial distribution in terms of p, the probability of success, and 1, 
the number of cases? What does the variance approach as 1 becomes large 
and p becomes small? What is the answer in terms of w= nf2] 
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NORMAL DISTRIBUTIONS 


Some of the most useful probability distributions are symmetrical and 
bell-shaped: 


In this chapter we study one such distribution called the normal distribu- 
tion. The normal distribution is valuable in describing various natural phenom- 
ena, especially those involving growth or decay. However, the real importance 
of the normal distribution in statistics is that it can be used to describe the 
results of sampling procedures. 

‘The normal distribution, like the Poisson, can be viewed as a limiting case 
of the binomial. More specifically, we start with any fixed probability of success 
‘P.and consider what happens as n becomes arbitrarily large. To obtain a visual 
representation of the limit, we draw histograms using an increasingly large 1 
for each fixed p. 

Histograms for p = .5 and for n 


|, 10, and 25, respectively. 


Histograms for p = .1 and for n = 4, 10, and 25, respectively. 
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Histograms for p = .7 and for n = 4, 10, and 25, respectively. 


aN 


For ou 


For greater clarity, different scales have been used for the histograms associ- 
ated with different 7's, From the diagrams above itis reasonable to accept that as 
n becomes larger without bound, the resulting histograms approach a smooth, 
bell-shaped curve. This is the curve associated with the normal distribution. 

The normal curve is bell-shaped and symmetrical with an infinite base, 
Long, flat-looking tails cover many values but only a small proportion of the 
area, The flat appearance of the tails is deceptive. Actually, far out in the tails, 
the curve is dropping proportionately at an ever-increasing rate, In other 
words, when two intervals of equal length are compared, the one closer to the 
center may experience a greater numerical drop, but the one further out in the 
Uuil experiences a greater drop when measured as a proportion of the height 
at the beginning of the interval, 

‘The mean here is the same as the median and is located at the center, We 
‘want a unit of measurement that will apply equally well to any normal 
distribution, and we choose a unit that arises naturally out of the curve’s shape, 
‘There is a point on each side where the slope is steepest. These two points 
are called points of inflection, and the distance from the mean to either point 
is precisely equal to one standard deviation. Thus, it is convenient to measure 
distances under the normal curve in terms of 2scores (recall from Chapter 3 
that scores are fractions or multiples of standard deviations from the mean). 


point ot inecton point of itaction 


PAV AVAN 


Normal curves with the same mean Normal curves with the same standard de- 
but different standard deviotions viation but different means 
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Mathematically, the formula for the normal curve tums out to be 
y= €* where yis the relative height above a z-score. (By relative height we 
‘mean proportion of the height above the mean.) However, our interest is not 
so much in relative heights under the normal curve as in proportionate areas, 
‘The probability associated with any interval under the normal curve is equal 
to the proportionate area found under the curve and above the interval, 

Table A in the Appendix gives proportionate areas under the normal curve. 
Because of the curve's symmetry, it is sufficient to give areas from the mean 
to positive z values, Table A shows, for example, that between the mean and 
a =score of 1.2 there is 3849 of the area: 


° 12 score 


while 4834 of the area lies between the mean and a >score of —2.13, 


213 ° score 


Brenpe 101 The life expectancy of a panicular brand of light bulbs is normally 
distributed with a mean of 1500 hours and a standard deviation 
of 75 hours. 


12751350 1425—=«1500= 157516501725 Hours 
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a, What is the probability that a bulb will last between 1500 and 


1650 hours? 
1500 1650 Hours 
eH 
° 2 score 


Answer: The =score of 1500 is 0, the =score of 1650 is (1650- 
1500/75 = 2, and 2 in Table A gives a probability of 472, 


b, What percentage of the light bulbs will last between 1485 and 


1500 hours? 
1485 1500 Hours 
a ey 
020 score 


Answer: The =score of 1485 is (1485-1500)/75 = ~0.2, and 0.2 
in Table A gives a probability of .0793 or 7.93%. 


©, What is the probability that a bulb will last between 1416 and 
1677 hours? 


Answer: The =score of 1416 is (1416 — 1500)/75 = —1.12, and 
the score of 1677 is (1677 — 1500)/75 = 2.36, In Table A, 1.12 
gives a probability of 3686, and 2.36 gives a probability of .4909. 
The total probability is 3686 + 4909 = 8595 


d. What is the probability that a light bulb will last between 1563 


and 1648 hours? 
15631848 Hours 


ost 197 tcorw 


Answer: The =score of 1563 is (1563 ~ 1500)/75 = 0.84, and of 
1648 is (1648 — 1500)/75 = 1.97. In Table A, 0.84 and 1.97 give 
probabilities of 2995 and 4756, respectively. Thus, between 1563 
and 1648 there is a probability of 4756 ~ .2995 = .1761. 


€. What is the probability that a light bull will last less than 1410 
hours? 


g 
i 


Answer: The >score of 1410 is (1410 ~1500)/75 = =1.2. In Table 
A, 1.2 gives a probability of 3849. The probability of being less 
than 1500 is .5, and so the probability of being less than 1410 is 
5 3849 = 1151. 


A packing machine is set to fill a cardboard box with a mean 
‘average of 16.1 ounces of cereal. Suppose the amounts per box 
form a normal distribution with standard deviation equal to 0.04 


‘ounce. 
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3 2 a 1 2 


a, What percent of the boxes will end up with at least 1 pound 


of cereal? 
16 161 Ounces 
ee 
25 ° 


Answer: The =score of 16 is (16 ~ 16.1)/0.04 = ~2.5, and 2.5 in 
Table A gives a probability of 4938. Thus, the probability of more 
than 1 pound is 4938 + .5 = .9938 or 99.38%. 


bb. Ten percent of the boxes will contain more than what number 
of ounces? 


Answer: In Table A, we note that 4 area (actually, 397) is found 
between the mean and a =score of 1.28, 50 to the right of a 1.28 
=score must be 10% of the area. Converting the =score of 1.28 
into a raw score yields 16.1 + 1.28(0.04) = 16.151 ounces. 


¢. Eighty percent of the boxes will contain more than what 
number of ounces? 
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ss 0 


16.086 16.1 Ounces: 
‘score 


Answer: In Table A, we note that 3 area (actually, 2995) is found 
between the mean and a >score of 0.84, s0 to the right of a ~0.84 
score must be 80% of the area. Converting the z-score of ~0.84 
into a raw score yields 16.1 — 0.84(0.04) = 16,066 ounces. 


d. The middle 90% of the boxes will be between what two 
weights? 


18 “eacore 


Answer: In Table A, we note that 4495 area is found between the 
mean and a =score of 1.64, while a 4505 area is found between 
the mean and a >score of 1.65, so use the score of 1.645. Then 
90% of the area is between zscores of —1,645 and 1.645. Con- 
verting =scores to raw scores yields 16.1 ~ 1.645(0.04) = 16.034 
ounces, and 16.1 + 1.645(0.04) = 16.166 ounces, for the two 
‘weights between which the middle 90% of the boxes will be. 


COMMONLY USED PROBABILITIES 
AND Z-SCORES 


As can be seen from Example 10.2d, there is often an interest in the limits 
enclosing some specified middle percentage of the data. For future reference, 
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the most frequently asked for of these limits are noted below in terms of 
zscores. 


Since 45 + 45 = .90, then 90% of the values are between 2scores of 
=1.645 and +1.645. Since 475 + 475 = .95, then 95% of the values are 
between zscores of =1.96 and +1.96. Since 495 + 495 = .99, then 99% of 
the values are between zscores of ~2.58 and +2.58, 

Sometimes the interest is in values with particular percentile rankings. For 
example: 


Ninety percent of the values are below a 2score of 1.28, ninety-five percent 
of the values are below a z-score of 1.645, and ninety-nine percent of the 
values are below a z-score of 2.33. 

‘There are corresponding conclusions for negative zscores: 


Ninety percent of the values are above a 2score of —1.28, ninety-five 
percent of the values are above a z-score of ~1.645, and ninety-nine percent 
of the values are above a zscore of ~2.33. 

Its also useful to note the percentages corresponding to values falling 
between integer scores. For example: 


Since 3414 + .3413 = 6826, then 68.26% of the values are between zscores 
of Land +1. Since 4772 + .4772 = .9544, then 95.44% of the values are 
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between zscores of —2 and +2. Since .4987 + 4987 = .974, then 99.74% of 
the values are between =scores of —3 and +3. 


Gromple 10.3 Suppose that the average height of adult males in a particular 
———— locality is 70 inches with a standard deviation of 2.5 inches. 


a. If the distribution is normal, the middle 95% of males are 
between what two heights? 


Answer: As noted above, the critical zscores in this case are 
£1.96, so the two limiting heights are 1.96 standard deviations 
from the mean. Therefore, 70 * 1.96(2.5) = 70 + 4.9, or from 65.1 
to 74,9 inches. 

b, Ninety percent of the heights are below what value? 


Answer: The critical score is 1.28, so the value in question is 70 
+ 1.28025) = 70 + 3.2 = 73.2 inches. 


¢. Ninety-nine percent of the heights are above what value? 


Answer: The critical score is ~2.53, 50 the value in question is, 
70 ~ 2382.5) = 70 ~ 5.825 = 64.175 inches. 


d. What percentage of the heights are between scores of +11? Of 
+2 Of £3 


Answer: 68.26%, 95.44%, and 99.74%, respectively. 


FINDING MEANS AND STANDARD 
DEVIATIONS 


If we know that a distribution is normal, we can calculate the mean 4 and 
the standard deviation o, using percentage information from the population 


Example 104 Given a normal distribution with a mean of 25, what 
standard deviation if 189% of the values are above 29? 


the 


100 Probability Distributions 


Hl 


Eee Raw score 
+ + 
0 092 score 


Answer: Looking for a .32 probability in Table A, we note that 
the corresponding z-score is 0.92. Thus 29 ~ 25 = 4 is equal to 
a standard deviation of 0.92, that is, 0.920 = 4, and a = 4/092 = 
438, 


Given a normal distribution with a standard deviation of 10, what 
is the mean if 21% of the values are below 50? 


Answer: Looking for a .29 probability in Table A leads us to a 
z-score of ~0.81. Thus 50 is -0.81 standard deviation from the 
mean, so p= 50 + 081(10) = 58.1 


Given a normal distribution with 80% of the values above 125 and 
90% of the values above 110, what are the mean and the standard 
deviation? 


Answer: Table A gives critical zscores of —0.84 and ~1.28. Thus 
we have (125 ~ w/o = —0.84 and (110 ~ w/o = ~1.28. Solving 
the system [125 — p = ~0.840, 110 — » = ~1.280} simulane- 
ously gives p= 153.64 and & = 34.09, 
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NORMAL APPROXIMATION TO THE 
BINOMIAL 


Many pructical applications of the binomial involve examples where 1 is 
large. However, for large n, binomial probabilities can be quite tedious to 
calculate, Since the normal can be viewed as.a limiting case of the binomial, itis 
natural to use the normal to approximate the binomial in appropriate situations. 
‘The binomial takes values only at integers, while the normal is continuous 
with probabilities corresponding to areas over intervals, Therefore, we must 
establish some technique for converting from one distribution to the other, For 
approximation purposes we do as follows. Each binomial probability will 
correspond to the normal probability over a unit interval centered at the desired 
value. Thus, for example, to approximate the binomial probability of eight 
successes we determine the normal probability of being between 7.5 and 8.5, 


Trample 107 Suppose that 15% of the cars coming out of an assembly plant 
have some defect. In a delivery of 40 cars what is the probability 
that exactly five cars have defects? 


Answer: The actual answer is 325 (.15)%.85)™, but clearly this 


involves a nontrivial calculation. To approximate the answer using 
the normal, we first calculate the mean wand the standard 
deviation «as follows 


B= np = 400.15) = 


= Ynp= p) = V9001385) = 2258 


45 55 Cars with detects 


086 -022 score 


We then calculate the appropriate zscores: (4.5 — 6)/2.258 = 
0.66 and (5.5 — 6)/2.258 = -0.22. Looking up the correspond- 
ing probabilities in Table A, we obtain a final answer of .2454 — 
(0871 = .1583. (The actual answer is .1692.) 


Even more useful are approximations relating to probabilities over intervals. 
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Erempe 108 1f 60% of the population suppor massive federal budget cuts, 
what is the probability that in a survey of 250 people at most 155 
people support such cuts? 


Answer: The actual answer is the sum of 156 binomial expres- 
sions: 


250! 
155195) 


However a good approximation can be obtained quickly and 
easily by using the normal. We calculate @ and o: 


= mp = 25006) = 150 


@ = np — p) = V25K6KA) = 7.746 


(A $+ SI COCA 


150. 1555 
c ——+ 
0 on ‘score 


‘The binomial of at most 155 successes corresponds to the normal 
probability of $155.5. The z-score of 155.5 is (155.5 — 150)/7.746 
= 0.71. Using Table A leads us to a final answer of 5000 + .2611 
= 7611 


Is the normal a “good” approximation? The answer, of course, depends on 
the error tolerances in particular situations. A general rule of thumb is that the 
normal is a good approximation to the binomial whenever both np and n 
(1 — p) are greater than 5. 
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Bample 10.9 A particular form of cancer is fatal within 1 year in 30% of all 
diagnosed cases. A new drug is tried on 200 patients with this 
disease. Researchers will judge the new medication effective if at 
least 150 of the patients survive for longer than 1 year. If the 
medication has no effect, what is the probability that at least 150 
patients survive for longer than 1 year? 


Answer: The actual answer is the sum of 51 binomial expressions: 


200! yy0¢ 3y0 4 20! 
150150! © 7 151 


THB ++ CM 


which would be very tedious to calculate, However, an approxi- 
mate answer using the normal is readily calculated. 


= np = 200.7) = 140 


@ = np — p) = Y20007KD = 648 
140 1495 
+ 
° 147 score 


‘The binomial probability of 150 successes corresponds to the 
normal probability on the interval from 149.5 to 150.5, and thus 
the binomial probability of at least 150 successes corresponds to 
the normal probability of =149,5. The score of 149.5 is (149.5 
~ 140)/6.48 = 1.47. Using Table A gives us a final answer of 5000 
— 4292 = .0708. 


In some situations both the normal and the Poisson are reasonable approxi- 
mations to the binomial. 


Exenple 10.10 Since airline companies know that 4% of all reservations received 
will be no-shows, they overbook accordingly. Suppose there are 
126 seats on a plane, and the airline books 130 reservations, What 
is the probability that more than 126 confirmed passengers will 
show up? In other words, what is the probability that the number 
of no-shows will be three or less? 
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Answer: Since both np = 130(.04) = 5.2 and m1 — p) = 130(.96) 
= 124.8 are greater than 5, it is reasonable to determine a normal 
approximation. Also, both p = .04 = .05 and n = 130 = 20 so it 
is reasonable to calculate a Poisson approximation, 


35 52 No-shows 
rr! 
07m 0 score 


‘The normal approximation is calculated using = 130 and p = 
04 to derive p = 5.2 and @ = 2.2343. The zscore of 3,5 is 
(3.5 — 5.2)/2.2343 = —0.76, and the probability approximation is 
5 ~.2764 = .2236, 


‘The Poisson approximation to the binomial is calculated as fol- 
lows: 


(5295 (2y 


et + SM 8 + Ode + eS = 2381 
‘The actual answer is the sum of four binomial probabilities: 
130! 130! 
Ot 0498. 96)127 28 
TaraT ODN IONE + ET 048.969 
+130(.04).960" + (9691 = 2323 


EXERCISES 
For exercises 1-10 assume that the distribution is normal, 

1, A trucking firm determines that its fleet of trucks averages a mean of 12.4 
miles per gallon with a standard deviation of 1.2 mpg on cross-country 
hauls, What is the probability that one of the trucks averages more than 
13 mpg? Fewer than 10 mpg? 

2. A factory dumps an average of 2.43 tons of pollutants into a river every 
week. If the standard deviation is 0.88 tons, what is the probability that in 
a week more than 3 tons are dumped? More than 1 ton? Fewer than 2 tons? 
Fewer than 5 tons? 


10. 


u. 


2 


13. 
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An electronic product takes an average of 3.4 hours to move through an 
assembly line. If the standard deviation is 0.5 hours, what is the probability 
that an item takes between 3 and 4 hours? Between 2 and 3 hours? 
Between 4 and 4.4 hours? 


. The mean score on a college placement exam is 500 with a standard 


deviation of 100. What percentage of the students score above 700? 
Ninety-five percent score above what? Fifty percent score below what? 


. The averige noise level in a restaurant is 30 decibels with a standard 


deviation of 4 decibels. Ninety-nine percent of the time the noise level is 
above what value? Ninety-five percent of the time it is below what value? 


. The mean income per household in a certain state is $9500 with a standard. 


deviation of $1750. Ten percent of the household incomes are below what 
value? The middle ninety-five percent of the incomes are between what 
two values? The middle ninety-eight percent are between what two levels? 


Jay Olshansky from the University of Chicago was quoted in Chance News 
‘as arguing that for the average life expectancy to reach 100, 189% of people 
‘would have to live to 120. What standard deviation must he be assuming 
for this statement to make sense? 


Cucumbers grown on a certain farm have weights with a standard devia- 
tion of 2 ounces, What is the mean weight if 85% of the cucumbers weigh 
less than 16 ounces? 


. If 75% of all families spend more than $75 weekly for food, while 15% 


spend more than $150, what is the mean weekly expenditure and what is 
the standard deviation? 


A coffee machine can be adjusted to deliver any fixed number of ounces 
of coffee. If the machine has a standard deviation in delivery equal to 0.4 
‘ounce, what should be the mean setting so that an 8-ounce cup will 
overflow only 0.5% of the time? 


‘The probability that a certain brand of painkiller will cure a simple 
headache is .81, If the medication is given to 29 individuals with simple 
headaches, what is the probability that fewer than 20 are cured? That 
‘exactly 24 are cured? That at least 24 are cured? Solve using the normal as 
an approximation to the binomial. 


Assume that a baseball team has an “average” pitcher, that is, one whose 
probability of winning any decision is .5. If this pitcher has 30 decisions 
in a season, what is the probability he wins at least 20 games? Solve using, 
the normal as an approximation to the binomial. 


Five percent of all contract deliberations with a particular union result in 
strikes. In 50 such deliberations, what is the probability of exactly two 
strikes? Exactly three strikes? Compare the binomial solution with the 
Poisson and normal approximations. 
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Study Questions 
1. Earlier, the formula for the normal curve was given as y= e~*'?, where y 

isa relative height corresponding to the proportion of the value above the 

mean. 

a. Explain the significance that z appears as a squared term (What does 
this say about symmetry?) 

'b. Explain the significance of the minus sign (What happens to y as = 
becomes very large?) 

If we substitute Cx ~ w/e for zand give actual rather than relative values 

for 9; the formula becomes 


etx wifant 


1 

vow 

In this form, symmetry is around what value for a¥ 

d. What happens as x becomes very lange? 

€. What happens to the value of y when x = pif @ is chosen to be larger, 
and what does this indicate about the graph? (You might also try (© 
reason what the effect of the 0? term is on the graph if g is chosen to 
be larger) 

You should be reaching a conclusion about how the value of o controls 

the relative extent to which the bell-shaped graph is peaked or spread out, 


2. The normal is an example of a continuous probability distribution. More 
generally (for students with a little elementary calculus), if a function fl) 
is nonnegative on its domain, and if the definite integral of /€) over its 
domain is equal to 1, then f(x) is called a probability density function. 
Verify that each of the following is a density function over the domain 
indicated: 


) foo = 22 over 0s x52 @ fo) =howrisxss 


@ fod = 3} wert sxe OW) fo) = 38 wr 0s x <e 


3. If 2x) is.a probability density function for a variable x, then the probability 
» 
that x falls between two values @ and b is f fajdx Suppose that a 


telephone company has gathered data conceming the duration of calls 

lasting no more than 60 minutes, and has determined that fx) = 

(60 ~ 39/1800 over 0 = x = 60 (minutes) is the appropriate density 

function. 

(1) Find the probability that a randomly selected call lasts 3 minutes or 
less. 

(2) What proportion of calls last from 15 to 45 minutes? 

(3) What is the probability that a call will last more than 45 minutes? 


Part 4 


THE POPULATION MEAN 


The point of view of statistics is illustrated by the following 
example. Suppose a jar is filed with black and white balls. If we 
kenew the exact number of balls of each color, we could apply 
probability theory to predict the likelibood of drawing a ball of a 
particular color or of drawing a sample of a particular mixture. On 
the other hand, suppose we do not know the composition of the 
whole jar, but we bave drawn a sample. Statistics tells us with what 
confidence we can estimate the composition of the whole jar from 
that of the sample. 

In our daily lives we frequently make judgments based on the 
characteristics of samples. A consumer looks at a few price tags and 
concludes that a clothing shop is a bigh- or low-priced store. A bigh 
school senior talks with a group of students and then forms an 
opinion about the college these students attend. A traveling sales- 
person rings doorbells for a day and then estimates the profitability 
ofa new territory. 

In all these examples complete observations are impossible or at 
least impracticable. The same is true in more structured situations, 
A medical researcher can't test everyone who has a particular form 
of cancer, a retailer can't test every flashbulb from a shipment 
(there would be none left to sell), and a pollster can't survey every 
potential voter. 
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Tous we use samples to draw inferences about characteristics of 
the whole population. Many questions arise. With what size and in 
what manner should a sample be chosen? What conclusions about 
the population can be drawn from the sample? With what degree 
of confidence can these conclusions be stated? In answering these 
questions, we keep in mind tbat we are usually considering only a 
few members out of a large population and thus can never make 
any inference about the whole population with 100% certainty. We 
can, however, draw inferences with specified degrees of certainty. 

In Chapters 11-15 we learn bow samples are used both to 
establish confidence intervals of population means and to analyze 
hypothesis tests about population means. We consider single popu- 
lations and the comparison of two or more populations. The effect 
of using large or small samples is also taken account of. 


11 


CONFIDENCE INTERVAL 
OF THE MEAN 


Heights, weights, and like measurements tend to result in normal distribu- 
tions, but normally distributed natural phenomena occur much less frequently 
than might be supposed. However, the normal curve has an importance in 
statistics that is independent of whether or not it appears in nature, What is 
significant is that the results of many types of sampling experiments can be 
analyzed using the normal curve. For example, there is no reason to suppose 
that the amounts of money that different people spend in grocery stores are 
normally distributed. However, if every day we survey 30 people leaving a 
store and determine the average grocery bill, these daily averages will have a 
nearly normal distribution, This statistical key will enable us to use a sample 
‘mean to estimate a population mean. 


DISTRIBUTION OF SAMPLE MEANS 


We are interested in estimating the mean of a population, For our estimate 
we could simply randomly pick a single clement of the population, but 
we then would have litte confidence in our answer. Suppose instead we 
pick 100 elements and calculate their average. It is intuitively clear that the 
resulting sample mean has a greater chance of being closer to the mean 
of the whole population than does the value for any individual member 
of the population. 

When we pick a sample and measure its mean, we are finding exactly one 
sample mean out of a whole universe of sample means. To judge the 
significance of a single sample mean, we must know how sample means vary. 
Consider the set of means from all possible samples of a specified size. It is 
both apparent and reasonable that the sample means will be clustered about 
the mean of the whole population; furthermore, these sample means will have 
a tighter clustering than do the elements of the original population. In fact, we 
might guess that the larger the chosen sample size, the tighter will be the 
clustering. 

How do we calculate the variance of the set of sample means? Suppose the 
variance of the population is ofop, and we are interested in samples of size 7. 
Sample means are obtained by first summing together » elements and then 
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dividing by 7. In Chapter 2 we leamed that a set of sums has a variance equal 
to the sum of the variances associated with the original sets. In our case 


2g = Oly H.+o2, = N07 
Fume = Pop Fee = "pop 


Also from Chapter 2 we remember that when each element of a set is 
divided by some constant, the new variance is the old one divided by the 
square of the constant. In our case the means are obtained by dividing the 
sums by m, so the variance of the sample means is obtained by dividing the 
variance of the sums by 1. Thus if o symbolizes the variance of the sample 
means, we find that: 


2 NO} _ OF 
o? = Simm — hp _ Spe 
ee ” ” 


We note that the variance of the set of sample means varies directly as the 
variance of the original population and inversely as the size of the samples, In 
terms of standard deviations, we have 


CENTRAL LIMIT THEOREM 


With the above result in hand we state the following principle which forms 
the basis of much of what we do in this chapter and following ones. It is a 
simplified statement of the central limit theorem of statistics. 

curt with a population with a given mean 4 and standard deviation @. Pick 
nn sufficiently large (at least 30), and take all samples of size n. Compute the 
mean of each of these samples. Then: 


1. The set of all sample means will be approximately normaly distributed 

2. The mean of the set of sample means will equal 1, the mean of the 
population. 

3. The standard deviation, 6, of the set of sample mean will be approxi- 
mately equal to o/Vn, that is, equal to the standard deviation of the 
whole population divided by the square root of the sample size. 


Suppose that the average outstanding credit card balance for 
young couples is $650 with a standard deviation of $420. If 100 
couples are selected at random, what is the probability that the 
mean outstanding credit card balance exceeds $700? 


Answer: The sample size is over 30, so by the central limit 
theorem the set of sample means is approximately normally 


distributed with mean 650 and standard deviation 420/100 = 42. 
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With a z-score of (700 — 650)/42 = 1.19, the probability that the 
sample mean exceeds 700 is 5000 — 3830 = 1170. 


CONFIDENCE INTERVAL ESTIMATE OF THE 
POPULATION MEAN 


Using a measurement from a sample, we will never be able to say exactly 
what the population mean is; rather we will always say we have a certain 
confidence that the population mean lies in a particular interval. The central 
limit theorem gives us the probability that a sample mean lies within a specified 
interval around the population mean, but this is precisely the same as saying 
that the population mean lies within a specified interval around the sample 
mean (the distance from Ithaca to Elmira is the same as the distance from 
Elmira to Ithaca). 


Eremple 112 A bottling machine is operating with a standard deviation of 0.12 
‘ounce. Suppose that in a sample of 36 bottles the machine inserted 
an average of 16.1 ounces into each bottle. 


a, Estimate the mean number of ounces in all the bottles this 
machine fills. More specifically, give an interval in which we 
are 95% certain that the mean lies. 


1.96 1.96 Z8core 


Answer: For samples of size 36, the sample means are approxi- 
mately normally distributed with a standard deviation of a; = 
on = 0.12N3% = 0.02 


From Chapter 10 we know that 95% of the sample means should 
be within 1.96 standard deviations of the population mean, 
Equivalently, we are 95% certain that the population mean is 
within 1.96 standard deviations of any sample mean.' In our case, 
16.1 + 1.96(0,02) = 16.1 + 0.0392, and we are 95% sure that the 


We cannot say there is 4.95 probabulitythat the population mean is within 1.96 standard deviations 
(of a given sample mean. For a given sample mean, the population mean either is or is not within 
the specified interval, s0 the probability is either 1 or 0 
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mean number of ounces in all bottles is between 16.0608 and 
16.1392. This is called a 95% confidence interval estimate. 


b. How about a 99% confidence interval estimate? 


258 258 xscorw 


Answer: Here, 16.1 + 2.58(0.02) = 16.1 * 0.0516, and we are 
‘99% sure that the mean number of ounces in all bottles is between 
16.0484 and 16.1516, 


Note that, when we wanted a higher certainty (99% instead of 95%), we had 
to settle for @ larger, less specific interval (+0.0516 instead of +0.0392), 

From Example 11.2 we can give simple formulas for the 95% and 99% 
confidence interval estimates, respectively: 


+1960, = Ft 
In 


F £2580, = F258 
In 


Confidence intervals other than 95% and 99% are obtained by replacing 1.96 
or 2.58 by the appropriate =score. 


73 In a certain plant, batteries are being produced with a life expec- 
tancy that has a variance of 5.76 months squared. Suppose the 
mean life expectancy in a sample of 64 batteries is 12,35 months. 


a. Find a 90% confidence interval estimate of life expectancy for 
all the batteries produced in this plant. 


1845 1645 score 


Answer: The standard deviation of the population is « = V5.76 
= 2.4, and the standard deviation of the sample means is 0; = 


14 
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an = 2464 = 0.3. The 90% confidence interval estimate for 
the population mean is 12.35 + 1.645(0.3) = 12.35 + 0.4935. Thus 
we are 90% certain that the mean life expectancy of the batteries 
is between 11.8565 and 12.8435 months. 


b. What would the 90% confidence interval estimate be if the 
sample mean of 12.35 had come from a sample of 100 batteries? 


Answer: The standard deviation of sample means would then 
have been o; = @/Vn = 2.4/V100 = 0.24, and the 90% confidence 
interval estimate would be 12.35 = 1.645(0.24) = 12.35 + 0.3948, 


Note that when the sample size increased (from 64 10 100), the 
same sample mean resulted in a narrower, more specific interval 
(20,3948 versus +0.4935). 


‘A new drug results in lowering heart rates by varying amounts 
with a standard deviation of 2.49 beats per minute, 


a, Pind a 95% confidence interval estimate for the mean lowering 
of heart beats in all patients if a 50 person sample averages a 
drop of 5.32 beats per minute. 


Answer: The standard deviation of sample means is 


o; = oN = 249/50 = 0.352. 


We are 95% certain that the mean lowering of heart beats is in the 
range 5.32 > 1.96(0.352) = 5.32 + 0.69 or between 4.63 and 6.01 
heartbeats per minute, 


b. With what certainty can we assert that the new drug lowers 
heart rates by a mean of 5.32 + 0.75 beats per minute? 
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457 532 607 


“213 0 213 ‘Pscore 


Answer: Converting 0.75 to a score yields 0.75/0.352 = 2.13, 
For a =score of 2.13 Table A gives 4834, and our answer is 
2.4834) = 9668. In other words, 5.32 + 0.75 beats per minute is 
‘4 96.68% confidence interval estimate of the mean lowering of 
heart rate effected by this drug, 


SUBSTITUTION OF s WHEN o IS UNKNOWN 


Frequently we do not know @, the population standard deviation, In such 
. We must use 5, the standard deviation of the sample, as an estimate for 
Sana 
. However, if we compute s as 2a 
samples of size n tend to give underestimations for ¢. Fortunately, ther 


im 


correction factor, "> 


n_ [Sx—3¥ _ [Ex 
” ” 
‘we will always calculate the standard deviation of a sample by using the 
formula 


cas 


, the resulting s values from 


Noting that 


whenever we intend to use $s an estimate for @. For large sample size n, the 
correction becomes quite small. As there was for o, there is an arithmetical tool 


for calculating s 
ax 

,- Poa PS 

nat =i 
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An advertiser wishes to determine the mean number of hours per 
week that teenagers spend before television sets, The results of 
500 interviews are as follows; Ex = 16,475 and E(x — 3 = 
48,907. Determine a 98% confidence interval estimate. 


Answer: We first calculate the sample mean and standard devia- 
tion: 


233 233 escore 


‘We use 5 = 9.9 as an estimate for the population standard 
deviation @ to find the standard deviation of the sample means: 
0; = an = 9.9/1500 = 0.443. We can be 98% sure that the 
mean number of hours per week that teenagers spend before 
television sets is in the range 32.95 + 2.330.443) = 32.95 + 1.03, 
‘or between 31.92 and 33.98 hours. 


A survey was conducted on 250 out of 125,000 families living in 
a city. The average amount of income tax paid per family in the 
‘sample was $3540 with a standard deviation of $1150. Establish a 
‘99% confidence interval estimate for the total taxes paid by all the 
families in the city. 
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Answer: We are given X = 3540 and s = 1150. We use sas an 
estimate for @ and calculate the standard deviation of the sample 
‘means: 


03 = on = 1150/N250 = 72.73 


A.99% confidence interval estimate for the mean tax paid by each 
family is 3540  2.58(72.73) = 3540 + 187.64. A 99% confidence 
interval estimate for the total taxes paid by all the families is 
125,000(3540 + 187.64) = 442,500,000 * 23,455,000 or between 
$419,045,000 and $465,955,000. 


SELECTING A SAMPLE SIZE 


Statistical principles are useful not only in analyzing data, but also in setting 
up experiments. One consideration involves the choice of a sample size. In this 
chapter we have made interval estimates of population means and have seen 
that each inference must go hand in hand with an associated confidence level 
statement. Generally, if we want a smaller, more precise interval estimate, we 
cither decrease the degree of confidence or increase the sample size. Similarly, 
if we want to increase the degree of confidence, we may either accept a wider 
interval estimate or increase the sample size. Thus, choosing a larger sample 
size seems always desirable; in the real world, however, time and cost 
considerations are involved. 


Trompe 117 Ball bearings are manufactured by a process that results in a 

——— standard deviation in diameter of 0,025 inch. What sample size 
should be chosen if we wish to be 99% sure of knowing the 
diameter to within £0.01 inch? 


258 258 2score 
Answer: We have 
o; = Nn = 0.025Nn 


‘and 2.580%; < 0.01. Thus 2.58 (0.025/¥n) < 0.01. Algebraically, we 
find that Yn = 2.58(0.025)/0.01 = 6.45, so n = 41.6. We choose 
a sample size of 42 
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Example 119 
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A formula for the above calculation is 


(oe) 


which in this problem gives m = ((2.58X0.025)/0.01F = 41.6, 


” 


A government investigator plans to test for the mean quantity of 
particular pollutant that a manufacturer is dumping per day into 
a river, She needs an estimate that is within 50 grams at the 95% 
confidence level. If previous measurements indicate that the vari- 
ance is approximately 21,800 grams?, how many days should she 
include in the sample? 


Answer: We have o? = 21,800, so ¢ = 21,800 = 147.65. Then 
oy = o/Nn = 147.65/Nn and 1.9605 = 50. Thus 1.96(147.65/Vn) 
S50, Vin = 1.96(147,65)/50 = 5.788, and n = 33.5, The investi- 
gator should sample 34 days’ dumping, 


A sociologist is designing an experiment to determine the mean 
‘age of USS, citizens who have strong opinions against committing 
funds for military operations in Bosnia, She has determined that, 
for a 90% confidence estimate of the mean age to within +3.0 
years, she will need to survey 100 individuals having the specified 
opinions, What would be the sample size for a 90% confidence 
‘estimate of the mean age to within 21.5 years? To within 1.0 
year? 


1.96 1.98 2ecore 


Answers: We have 0;=Aio and 16450; = 30, or 
1.645(°A100) = 3.0. We want to know m and nm where 
1.645("Aim) = 1.5 and 1.645(°A%) = 1.0. Algebraically, we find that 
Vin, = ' 594 5, and Yn, = }59% 0, and from the known sample 
size we have 1.6450 = 3.0V100. Thus, Vn, = 3.0V100/..5, so m 
= 400. Also, Vig = 3.0V100.0, so nz = 900. 
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‘Two points are worth noting. First, what would have been the 
result if, for example, 95% had been used instead of 90% for the 
confidence level? 


Answer: Fach 1.645 would have been replaced by 1.96, and 
instead of dividing 1.645 by 1.645, we would divide 1.96 by 1.96. 
The resulting answers, 400 and 900, however, would not have 
changed. 


Second, how much must the sample size be increased in order to 
cout the interval estimate in half? To a third? 


Answer: To cut the interval estimate in half (from +3.0 to 1.5), 
‘we would have to increase the sample size fourfold (from 100 to 
400). To cut the interval estimate to a third (from +3.0 to +1.0), 
we would have to increase the sample size ninefold (from 100 t0 
900). 


More generally, if we want to divide the interval estimate by d without 
affecting the confidence level, we must increase the sample size by a multiple 
of d* 


EXERCISES 


1. In a test for acid rain, 49 water samples showed a mean pH level of 4.4 
with a standard deviation of 0.35. Find a 90% confidence interval estimate 
for the mean pH level. 

2, One gallon of gasoline is put into each of 30 test autos, and the resulting 
mileage figures tabulated with X = 28,5 and s = 1.2, Determine a 95% 
confidence interval estimate of the mean mileage. 

3. For a sample of 490 men judged low in fitness, the resting heart rate 
(beats/min) had a mean of 64 with a standard deviation of 10; the 
triglyceride count (mg/dl) showed a mean of 126 with a standard deviation 
of 68; and the vital capacity (mL) had a mean of 4029 with a standard 
deviation of 860. The comparable figures among a sample of 487 men 
judged high in fitness were x = 59 and s = 9 for heart rate, X = 101 and 
5 = 48 for triglycerides, and = 4827 and s = 867 for vital capacity (New 
England Journal of Medicine, February 25, 1993, page 535.) 

a. Find 90% confidence interval estimates for the mean heart rates of the 
men judged low and judged high in fitness. 

b. Find 95% confidence interval estimates for the triglyceride means of 
both samples. 

¢. Find 99% confidence interval estimates for the vital capacity means of 
both samples, 
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4, The number of accidents per day in a lange factory is noted for each of 64 
days with 3 = 3.58 and s = 1.52. With what degree of confidence can we 
assert that the mean number of accidents per day in the factory is between 
3.20 and 3.962 


5, A company owns 335 trucks. For a sample of 30 of these trucks, the 
average yearly road tax paid is $9540 with a standard deviation of $1205. 
‘What would be a 99% confidence interval estimate for the total yearly road 
taxes paid for the 335 trucks? 


6, What sample size should be chosen to find the mean number of absences 
per month by school children to within +.2 at a 95% confidence level if it 
is known that the standard deviation is 1.1? 


7. Hospital administrators wish to learn the average length of stay of all 
surgical patients. A statistician determines that, for a 95% confidence level 
estimate of the average length of stay to within £0.5 days, 50 surgical 
patients’ records will have to be examined. How many records should be 
looked at for a 95% confidence level estimate to within £0.25 days? +1 
day? 

8. The 1995 profits (in millions of dollars) for a sample of 35 corporations 
are: 23, 43, 12, 3, 45, 41, 0, 23, 18, 37, ~12, 15, 71, 22, 10, 33, 34, 61, 20, 
=21, 0, 29, 18, 57, 58, 0, 35, 38, 21, —32, ~39, 17, 21, 40, 29. Establish a 
98% confidence interval estimate for the 1995 profits of all corporations. 


9. The ages of 49 company presidents give Ex = 2753.8 and Xx — 9 = 
1348.32. Determine a 90% confidence interval estimate for the mean age 
‘of company presidents, 

10, In a sample of 36 hours of television programming, the number of minutes, 
‘of commercials for each hour is noted and the totals tabulated as Lx = 
378 and Ex? = 4123.35. Determine a 95% confidence interval estimate for 
the mean number of commercial minutes per hour of television program- 
ming. 


Study Questions 
1. Consider a population of size N = 5 consisting of the elements [3, 6, 8, 
10, 181 
a. Determine the population mean 1 and the population standard devia- 
tion @. (For use in Study Question 3, note that the population variance 
o? = 256.) 
. List all possible samples of size = 2 [there are 5,2) = 10 of them, 
and determine the mean ¥ of each sample. 
¢. Show that the mean of the set of ten sample means is equal to the 
Population mean 
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d. Show that the standard deviation of the set of ten sample means is 
equal to 


{You should find that the calculation from the definition and from this 
formula are both equal to 3.0984} * 
. More generally, what can be said about Tn 
size Nis very large? tg 
2. a. For the same set as in Study Question 1, 13, 6, 8, 10, 18), form all 
possible “ordered” samples of size 2. In other words, (3,6) and (6,3) 
will be considered different samples, and duplications such as (3,3) will 
also be allowed. 
b. Determine the mean for each of the 5 X 5 = 25 resulting samples, 
¢. Show that the standard deviation of the set of these 25 sample means 
is precisely equal to o/Vn, where @ is the population standard deviation 
and n = 2 is the sample size. 
3. Consider again the set of 25 “ordered” samples from Study Question 2, 
a. For each compute the variance using # = 3(x ~ ¥?/n. (Note that each 
& is over only two terms, and n = 2.) 
'b, Sum and divide by 25 to find the average of these variances. 
c. Repeat the calculations this time using ¢ = Xx — ¥¥/(n — 1). 
Note that using 1 — 1 instead of 1 results in the average of the sample 
variances being 25.6 which is precisely the variance @ of the original 
population (3, 6, 8, 10, 181. This example is true more generally and 
indicates why we define the sample standard deviation by 


4, In this chapter we noted the effect on sample size if the interval estimate 
is changed without changing the confidence level. What happens if we 
want to change the confidence level without changing the interval esti- 
mate? For example, for a given interval estimate, what is the effect on 
sample size if we go from a 90% confidence level to a 95% confidence 
Jevel? From 90% to 9996 
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HYPOTHESIS TEST 
OF THE MEAN 


Closely related to the problem of estimating a population mean isthe problem 
of testing a hypothesis about a population mean. For example, a consumer 
protection agency might determine an interval estimate for the mean nicotine 
content of a particular brand of cigarettes, or, alternatively, it might test a 
manufacturer's claim about the mean nicotine content of his cigarettes. An 
agricultural researcher could find an interval estimate for the mean productivity 
gain caused by a specific fenilizer, or alternatively, she might test the developer's 
claimed mean productivity gain. A social scientist might ascertain an interval 
estimate for the mean income level of migrant farmers, or alternatively, he might 
test a farm bureau's claim about the mean income level. In each of these cases, 
the experimenter must decide whether the interest lies in an interval estimate of 
a population mean or in a hypothesis test of a claimed mean, 


NULL AND ALTERNATIVE HYPOTHESES 


‘The general testing procedure is to choose a specific hypothesis to be tested, 
called the mull bypotbesis, pick an appropriate sample, and then use measure- 
ments from the sample to determine the likelihood of the null hypothesis. Just 
as in Chapter 11, conclusions are stated, not with absolute certainty, but rather 
with associated significance levels 

‘There are two types of possible errors we will consider: the error of 
mistakenly rejecting a true null hypothesis and the error of mistakenly failing 
to reject a false null hypothesis. For example, a manufacturer claims that the 
mean lifetime of an electronic component of his product is 1500 hours. A 
researcher believes that the true figure is lower and will test the 1500-hour 
claim by measuring the lifetime of each element in a sampling of components. 
‘The researcher decides that, ifthe sample average is less than 1450 hours, she 
will reject the manufacturer's claim. Alternatively, if the sample average ex- 
ceeds 1450 hours, she will conclude that she does not have sufficient evidence 
to reject the 1500-hour claim. 

‘The claim to be tested, the mull hypothesis, denoted as H,, is usually what 
‘we want to challenge, and is stated in terms of a specific value for a population 
parameter. In the case considered here, 


Hs w= 1500 
121 
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The alternative bypothests, denoted as H,, is usually what we want 10 
establish, and is stated in terms of an inequality such as <, >, or #, In this 
case 


Ha: we < 1500 


If the akernative hypothesis involves the inequality < or >, there is one 
critical value ¢ that separates the null hypothesis rejection region from the fail 
10 reject region. In this case 


= 1450 ang Mies sion |_ fal 0 reject region 


If the 1500-hour claim is true but the sample mean happens to be less than 
1450, the researcher will mistakenly reject the null hypothesis, This is called a 
Type Terror, and the probability of committing such an error is called the a-risk, 
If the 1500-hour claim is false, but the sample mean happens to be over 1450, 
the researcher will mistakenly fail to reject the null hypothesis. This is called a 
Type If error, and the probability of committing such an error is called the 
B-risk. (The Greek leters used to designate Type I and Type II errors are 
lower-case alpha and beta, respectively.) 


Null hypothesis (4 = 1500) 


Tre False 
less Type | enor 
than Mistakenly reject Seeract decision 
sample 450 | manufacturers claim eject fase claim 
mean 
Type Wevor 
More ‘Creact decison: ‘Mistakenty fail to 
than Do not roet : 
1450 ‘rue clabe reject manufacturer's 
false claim 


Several questions immediately come to mind, How should the critical value 
¢ be chosen? How can the probabilities of Type I and Type Il errors be 
calculated? What effect does sample size have on the whole procedure? These 
and other queries will be answered in the discussion and examples of this 


chapter. 


CRITICAL VALUES AND a-RISKS 


For a given critical value, such as that in the example above, we can sketch 
the distribution of sample means around the claimed population mean, and 
then note the region corresponding to the probability of a Type I error or 
actisk. 
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Reject Ho 


Treaph 121 A coffee dispensing machine is supposed to drop 8 ounces of 
liquid into each paper cup, but a consumer believes that the actual 
amount is less, As a test he plans to obtain a sample of 36 cups 
of the dispensed liquid, and, if the mean content is less than 7.75 
‘ounces, to reject the 8-ounce claim. If the machine operates with 
a standard deviation of 0.9 ounces, what is the a-risk? 


Answer: We have 


8 Ounces 


‘The standard deviation of sample means is a; = oi = 
0.9/N36 = 0.15. The score for 7.75 is (7.75 — 8)/0.15 = ~1.67. 
Using Table A, we obtain a = 5000 — 4525 = .0475. Thus, if the 
S-ounce claim is correct, there is a .0475 probability that the 
consumer will stil obtain a simple mean less than 7.75 and will 
mistakenly reject the claim. 


‘A patient claims that he is consuming only 2000 calories per day, 
but a dietician suspects that the actual figure is larger. The dietician 
plans to check food intake on 30 days and will reject the patient's 
claim if the 30-day mean is more than 2100 calories. Ifthe standard 
deviation (in calories per day) is 350, what is the probability of a 
Type | error? 
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Answer: We have: 


He w = 2000 
Ha: w > 2000 
Fado reject H, 
eject Hy 
2000 2100 Calories 
° 156 ecore 


‘The standard deviation of sample means is a; = a/Vn = 
350/V30 = 63.9. The =score for 2100 is (2100 ~ 2000)/63.9 = 
1.56, and from Table A we obtain .5000 ~ 4406 = .0594, Thus, if 
the 2000-calorie claim is correct, there is a .0594 probability that 
the dietician will still obtain a mean greater than 2100 and 
mistakenly reject the patient's claim. 


Example 123 A government statistician claims that the mean monthly rainfall 
—~— along the Liberian coast is 15.0 inches with a standard deviation 
‘of 120 inches. A meteorologist plans to test this claim with 
measurements over 3.5 years (42 months). If she finds a sample 
mean more than 2.0 inches different from the claimed 15.0 inches, 
she will reject the government statistician’s claim. What is the 
probability that the meteorologist will mistakenly reject a true 


claim? 


Answer: We have: 
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This is an example of a two-sided (or two-tailed) test, and the 
acrisk is computed by adding the probabilities corresponding to 
two The standard deviation of the sample means is 6 = 
120N42 = 185. The zscore for 17 is (7 ~ 15)/1.85 = 1.08; 
similarly, the zscore for 13 is -1.08. From Table A we obtain 
5000 = 3599 = .1401, for a total probability of Type 1 error of 
1401 + 1401 = 2802. 


Examples 12.1-12.3 begin with critical values and proceed to find the 
corresponding a-tisks, In most situations, however, this reasoning is reversed 
that is, we choose acceptable a-risks and then calculate the corresponding 
ctitical values, The choice of a-risk is called the significance level of the test. 


Ereuple 124 A manufacturer claims that a particular automobile model will get 

———_ 50 miles per gallon on the highway. The researchers at a con- 
sumer-oriented magazine believe that this claim is high and plan 
a test with a sample of 30 cars. Assuming the standard deviation 
between cars is 2.3 miles per gallon, determine the critical value 
for a test at the 5% significance level. 


Answer: We have: 


493 50 Mpg. 
+ 


1.845 0 score 


‘The standard deviation of sample means is 03 = 2.3/V30 = 0.42. 
From Table A, the critical z-score corresponding to a = .05 is 
1.645, which translates 10 a critical raw score of 50 ~ 1.645(0.42) 
= 49.3. Thus, if the sample mean is less than 49.3 miles per gallon, 
the manufacturer's claim should be rejected. 
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Trample 125 A student handbook claims that college students study an average 

——~— of 30 hours per week. A guidance counselor plans to test this 
claim at the 196 significance level. What are the critical values if 
the standard deviation in the number of hours that students study 
per week is 8 hours? Assume a sample of 100 students. 


Answer: We have: 
Hs w= 30 
Hew * 30 
a= 01 
0; = 8/100 = 08 
Fallto reject He 


Reject H, | BS Reject H, 
27.94 30 32.06 Hours 


—— 
258 ° 258 score 


This is a two-sided (two-tailed) test, so for a total a-tisk of ,01 we 
put a probability of Y2 (01) = .005 on each side. A probability of 
005 gives critical zscores of +2.58. The corresponding critical 
hour values are 30 + 2.58(0.8) = 30 + 2.06. Thus the handbook 
claim of 30 hours should be rejected if the sample mean is less 
than 27.94 or greater than 32.06 hours. 


DRAWING CONCLUSIONS; p-VALUES 


Examples 12.1-1255 illustrate the relationship between critical values and 
actisks. We now review the relevant terminology, and then give more complete 
examples of hypothesis testing 

The nud! hypothesis is a claim about a population that we usually hope to 
dispute, while the alternative bypothests is an opposing claim that we hope to 
establish, In each of the above examples the null hypothesis is in the form of 
an equality statement about the population mean (such as. = 25.1), while 
the alternative hypothesis is stated as an inequality (such as p< 25.1 or p> 
25.1 or p ¥ 25.1). 

We aitempt to show that the null hypothesis is unacceptable by showing 
that it is improbable. Our testing procedure involves picking a sample and 
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comparing the sample statistic (such as the mean X) to the claimed population 
parameter (such as q). Critical values are chosen to gauge the significance of 
‘a sample statistic. If the sample statistic is far enough away from the claimed. 
population parameter, we say that there is sufficient evidence to reject the null 
hypothesis. The probability of committing a Type / error, that is the a-risk, is 
the probability of mistakenly rejecting a true null hypothesis. Later in this 
chapter we discuss more fully the Type / error, that of mistakenly failing to 
reject a false null hypothesis. 

In working with critical values and a-risks, we use the central limit theorem, 
that is that the set of all sample means is normally distributed around the 
population mean with a standard deviation equal to the population standard 
deviation divided by the square root of the sample size, In practice, the 
population standard deviation is usually unknown, and we use the sample 
standard deviation as an estimate for it 

‘The following examples illustrate the principles stated above and introduce 
the concept of p-values. 


27 An automotive company executive claims that a mean of 48.3 cars 
per dealership are being sold each month. A major stockholder 
believes this claim is high and runs a test by sampling 30 dealer- 
ships. 

a. What conclusion is reached if the sample mean is 45.4 cars with 
a standard deviation of 15.4? Assume a 10% significance level. 


Answer: 


He: w= 48.3 
Hew < 483 
a=.10 


Fallto reject H, 


Reject Hy 


Here 
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cars is 483 — 1.28281) = 44,7. Since 45.4 > 44,7, the stockholder 
should not reject the executive's claim. 


b. If @ had been chosen differently, would the executive's claim 
have been rejected? 


Answer: The sample statistic, 454, has a zscore of 
(45.4—48.3)/2.81 = —1.03. Using Table A, we note that if « was 
5000 — 3485 = .1515 Cor larger), the stockholder would have 
sufficient evidence to reject the 48.3-car claim. 


With Example 12.7 as motivation, we define the p-value of a test to be the 
smallest value of a for which the null hypothesis would be rejected, Or, 
equivalently, if the null hypothesis is assumed to be true, the p-value of a 
sample statistic is the probability of obtaining a result as extreme as the one 
obtained. Note that the smaller the p-value, the more significant is the differ- 
ence between the null hypothesis and the sample results. In Example 12.7, the 
value is .1515. Therefore, although the null hypothesis could not be rejected 
at the 10% (or even the 15%) significance level, it could be rejected at, for 
example, the 16% significance level. 


Example 128 A manufacturer claims that a new brand of air-conditioning unit 
———"_ uses only 6.5 kilowatts of electricity per day. A consumer agency 
believes the tue figure is higher and runs a test on a sample of 
size 50. 
a. If the sample mean is 7.0 kilowatts with a standard deviation 
of 1.4, should the manufacturer's claim be rejected at a signifi- 
cance level of 5%2 OF 1962 


Answer: 
He w= 65 
He w> 65 
a= 05 
Fal to reject H, 
Reject H, 
es 2 Kowal 
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Here 
a. gfe uid 3 

Tn Tn 0 

With a = .05, the critical z-score is 1.645. There are actually two 
ways to proceed. We can convert 1.645 to a critical number of 
kilowatts and check whether 7.0 is greater than this critical number, 
or we can convert 7.0 to a z-score and compare this value 10 the 
critical zscore, 1.645. By the first method, 1.645 converts to a raw 
score of 6.5 + 1,645(0.198) = 6.83; 7.0 > 6.83, so reject Hy, By the 
second method, 7.0 converts to a zscore of (7.0 = 6.5/0.198 = 
2.53; 2.53 > 1.645, so reject H,,. Thus, at the 5% significance level, 
the consumer agency should reject the manufacturer's claim. 


o; 


Fait torject H, 
Reject H, 
65 696 Klowatts 
0 233 eco 


With a = .01, the critical zscore is 2.33 with a corresponding 
critical number of kilowatts: 6.5 + 230.198) = 6:96, Since 7.0 
> 6.96, the consumer should still reject the manufacturer's claim 
at the 196 significance level. Note that we could have compared 
2.53, the z-score of 7.0, with 2.33 to reach the same conclusion. 


'b. What is the p-value of this test result 


Answer: We have shown that the difference between the sample 
Statistic ( = 7.0) and the claimed population parameter (q. = 6.5) 
is significant at both the .05 and .01 levels. Calculation of the 
pyalue gives a more complete picture of the significance of the 
observed difference. 
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Example 12.9 


‘The zscore of 7.0 is 2.53; using Table A, we find the p-value 
to be 5000 ~ 4943 = .0057. Note that with this calculation we 
could have quickly concluded that at the 05 and .01 significance 
levels there is sufficient evidence (0 reject H,. We also see that 
there would have been sufficient evidence at the .006 level, but 
not at the .005 level. 


A local chamber of commerce claims that the mean family income 
level is $12,250. An economist nuns a hypothesis test, using a 
sample of 135 families, and finds a mean of $11,500 with a 
standard deviation of $3180. Should the $12,250 claim be rejected 
at a 5% level of significance? 


Answer: Note that this isa two-sided test; that is, the economist 
suspects that the claim may be incorrect, but does not know 
whether it is probably too high or too low. 
He pe = 12,250 
Hew # 12,250 
a= 05 


Fait to reject H, 


Reject Hy Roject H, 


For a total @ = .05, we place a probability of .5(.05) = .025 on 
each side, and note the critical zscores of +1.96, These translate 
into critical incomes of 12,250 + 1.96(273.7) or 11,713.45 and 
12,786.45. Since 11,500 < 11,713.45, the chamber of commerce 
claim is rejected. 

Alternatively, we could have changed 11,500 into a z-score: 
(11,500 ~ 12,250)/273.7 = —2.74 and observed that ~2.74 is less 
than the critical z-score, 1.96. 
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‘We could have also reasoned in terms of the p-value: For the 
zscore, -2.74, Table A gives a probability of 5000 — 4969 = 
0031. Since this is a two-tailed test, where disagreement with the 
null hypothesis can be in either of two directions, the p-value is 
twice the .0031 probability, or 0062. Since ,0062 < .05, there is 
sufficient evidence to reject H, 


Example 12.10 A pharmaceutical company claims that a medication will produce 

> a desired effect for a mean time of $8.4 minutes. A government 
researcher runs a hypothesis test of 250 patients and tabulates the 
following data with reference to the durations of effect in minutes: 
‘Ex = 14,875 and Sx—X)} = 17,155, Should the company’s claim 
be rejected at a significance level of 10%? Of 2%? 


Answer: 


Ex _ M075 «555 


250 
Sas pss 
nat apr =o 


2.2 185 
o; == = 0525 
a In 250 


o= 10 on 02 


5754 En 
eS r7 1S Beco 235 r] 233 


For @ = 10, the critical values (in minutes) are 58.4 + 
1,645(0.525), or 57.54 and 59.26. For a = .02, the critical values 
(in minutes) are 38.4 + 2.340.525), or 57.18 and 59.62. Now 59.5 
> 59.26, but 59.5 < 59.62, so the researcher would reject the 
company’s claim at the 10% significance level, but not at the 2% 
significance level. In other words, if she is willing to be wrong 10 
times out of 100, she will reject the company’s claim; but if she is 
willing to accept only 2 errors in 100 decisions, she will conclude 
that the evidence is not strong enough to reject the claim. 

Again, the p-value can be used to measure specifically the 
strength of evidence for rejection of H,. In this example, the 
-zscore for 59,5 is (59.5 ~ 58.4)/0.525 = 2.10. Using Table A, we 
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find the corresponding probability to be 5000 - .4821 = .0179. 
Doubling because the test is two-sided results in a p-value of 
2.0179) = 0358. 


TYPE Il ERRORS AND B-RISKS 


‘The preceding discussion and examples centered around critical values and 
corresponding Type I errors. The question arises, why not always choose the 
actisk to be extremely small, such as .001 or .0001, and so virtually eliminate 
the possibility of mistakenly rejecting a correct null hypothesis? The problem 
is that this course of action would simultaneously increase the chance of never 
rejecting the null hypothesis, even if it were false. 

Thus we are led to a discussion of the Type II error, that is, a mistaken 
failure to reject a false null hypothesis. As defined earlier, the probability of a 
Type I error is called the B-risk. We shall see that there is a different value of 
B for each possible correct value for the population parameter, 


Nall hypothesis, 
Tre False 
Accept Correct ‘Type I 
Me decision error 
Decision to 
Reject Type Correct 
4% error decision 


Erengle 1211 City planners are trying to decide among various parking:plan 

~~ options ranging from more on-street spaces to multilevel facilities 
to spread-out small lots. Before making a decision, they wish to 
test the downtown merchants’ claim that shoppers park for an 
average of only 47 minutes in the downtown area. The planners 
have decided to tabulate parking durations for 225 shoppers and 
to reject the merchants’ claim if the sample mean exceeds 50 
minutes. What is the probability of a Type II error if the true value 
is 48 minutes? If the true value is 51 minutes? Assume that the 
standard deviation in parking durations is 27 minutes. 


Answer: We have 
Hy w= 47 
Hg p> a7 


= ofdn = 27/4225 = 18 


Hypothesis Test of he Mean 133 


Fao reject H, 


Reject A 
a 2° Mires 


If the true mean parking duration is 48 minutes, the normal 
curve should be centered at 48. In this case, the z-score for 50 is 
(50 — 48/18 = 1.11. Using Table A, we calculate the B-risk 
(probability of failure to reject H,) to be .5000 + .3665 = 8665. 


Fait reject H, 


If the true mean parking duration is 51 minutes, then the 
normal curve should be centered at 51. The critical value is still 
50, with a =score now of (50 ~ 51)/1.8 = —0.56. We use Table 
Ato find the Brisk to be 5000 ~ .2123 = .2877. 


Fait reject Hy Reject H 

so 51 Minutes 
—EEEE 

06 0 ‘score 


‘Note that the further the true value is (in the suspected direc- 
tion) from the claimed mean, the smaller the probability is of 
failing to reject the false claim. 


In many situations we start with a significance level, use this to calculate the 
critical value, and then find particular B-risks. 
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Example 12.12 A geologist claims that a particular rock formation will yield a 

mean amount of 24 pounds of a chemical per ton of excavation, 

His company, fearful that the true amount will be less, plans to 

fun a test on a sample of 50 tons. 

a. If the standard deviation between tons is 5.8 pounds, what is 
the critical value? Assume a 1% significance level. 


Answer: We have 
Hy wad 
Hy ws 
a=01 
o _ 58 
= Fe = 082 
vn” 50 
Fal to reject Hy 
Reject My 
22.09 2 Pounds 
SS 
233 ° 2 6c0%0 


With @ = 01, the critical z-score is -2.33, and the critical 
poundage value is 24 ~ 2.33(0.82) = 22.09. Thus, the decision is 
to reject Hy if the sample mean is less than 22.09, and to fail to 
reject if it is more than 22.09. 


'b, What is the probability of a Type II error if the true mean is 22? 


Is 20? 
Reet 
Fat tort, 
2 709 Pounds 
oon ‘secore 


Answer: If the true mean is 22 pounds per ton of rock, then the 
zscore for 22.09 is (22.09 — 22)/0.82 = 0.11, and so the B-risk is 
5000 — .0438 = .4562. 
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Reject, 
Fait rojoct H, 
Ey 2209 Pounds 
° 255 Reco 


If the true mean is 20 pounds per ton of rock, then the score 
for 22.09 is (22.09 ~ 20)/0.82 = 2.55, and so the Brrisk is .5000 
= 4946 = 0054, 


2.13 A medical research team claims that high vitamin C intake in- 
teases endurance. In particular, 1000 mg of vitamin C per day for 
a month should add an average of 43 minutes to the length of 
maximum physical effort that can be tolerated, Army training 
officers believe the claim is exaggerated and plan a test on 400 
soldiers. 

a. If the standard deviation of added minutes is 3.2, find the 

critical values for a significance level of 5% and then of 1%. 


Answer: We have: 


Hy we 43 
Hy p< A3 
32/1900 = 0:16 
a= 05 aso 
Fail to Faitto 
Reject Hy reject Hy Reject Hy reject Hy 
400743 Minutes 3.927 43 
—1645—~=C«O ‘mecore 233 0 


Corresponding to a z-score of —1.645 is 4.3 ~ 1.645(0.16) = 4.037; 
corresponding to a z-score of —2.33 is 4.3 — 2330.16) = 3.927. 


b. In each case, what is the probability of a Type II error if the 
true mean increase is only 4.2 minutes? 


Answer: 
Fail to reject H, Fail to reject H, 
eject H, Reject Hy 
4037 42 Minutes 392742 
“1020 seco 171 O 


With a mean increase of 4.2, the score for 4.037 is 
(4.037~-4.2)/0.16 = 1,02, and the Brrisk is 5000 + 3461 = 
8461. With a mean increase of 4.2, the z-score for 3.927 is (3.927 
~ 4,2)/0.16 = =1.71, and the B-risk is 5000 + 4594 = 9564, 

Note that decreasing the a-risk (from .05 to .01) led to an 
increase in the B-risk (from .8438 to 9564). This is always the case 
if the sample size is held constant. 


F214 A factory manager claims that the plant's smokestacks spew forth 
‘only 350 pounds of pollution per day. A government investigator 
suspects that the true value is higher and plans a hypothesis test 
with a critical value of 375 pounds. Suppose the standard devia- 
tion in daily pollution poundage is 150. 

a, What is the a-risk for a sample size of 100 days? Of 200 days? 


Answer: We have: 


n= 100 n= 200 
150 150 
= 150 a = 106 
Too = ° * roo 
ramet 
rama 
as 
cto 
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With 0 = 15.0, the 2score for 375 is (375 — 3500/15.0 = 1.67, 
and the corresponding a-risk from Table A is .5000 ~ .452: 
0475. With 0; = 10.6, the score for 375 is (375 ~ 3500/10. 
2.36, and the corresponding a-risk is 5000 — .4909 = .0091 


b. What are the associated B-risks if the tie mean is 385 pounds? 


Roject Ho 


Failto 
roject Ho Failto 


378 385 Pounds 378 385 


er 6 ‘Bacore 8 


With n = 100 and a mean of 385, the z-score for 375 is (375 — 
385)/15.0 = -.67 with a corresponding B-risk of .5000 ~ .2486 = 
2514. With n = 200 and a mean of 385, the z-score for 375 is (375 
— 385)/10.6 = —.94 with a corresponding B-risk of .5000 ~ 3264 
= 1736. 

Note that in this example increasing the sample size (from 100 
to 200) resulted in decreases in both the a-risk (from 0475 to 
0091) and the B-risk (from .2514 to .1736), More generally, if « is 
held fixed and the sample size is increased, the B-risk will 
decrease. Furthermore, by increasing the simple size and adjust- 
ing the critical value, it is always possible to decrease both the 
a-risk and the Brisk. 


EXERCISES 


1, The label of a package of cords claims that the breaking strength of a cord 
is 3.5 pounds, but a hardware store owner believes the real value is less. 
She plans to test 36 such cords; if their mean breaking strength is less than 
3.25 pounds, she will reject the claim on the label. Ifthe standard deviation 
in breaking strengths of all such cords is 0.9 pounds, what is the prob- 
ability of rejecting a true claim? 

2. A union spokesman states that plumbers charge an average of $25 per 
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hour, but a consumers’ bureau doubts that the pay rate is this low. A test 
is planned in which 40 plumbers will be asked to submit estimates on a 
job, and from these estimates a mean hourly rate will be calculated. 
Assuming a standard deviation among rates of $4.50, determine the critical 
value for a test at the 5% significance level. 


3. A city spokesperson claims that the mean response time before a fire truck 
arrives at any fire is 12 minutes. A newspaper reporter suspects that the 
response time is actually longer and runs a test by examining the records 
of 64 fire emergency situations. What conclusion is reached if the sample 
mean is 13.1 minutes with a standard deviation of 6 minutes? Assume a 
286 significance level 


4, A service station advertises that its mechanics will change a muffler in only 
15 minutes. A consumers’ group doubts this claim and runs a hypothesis 
test using 49 cars needing new mufflers. In this sample the mean changing 
time was 16.25 minutes with a standard deviation of 3.5 minutes. 

a. What is the p-value? 
b, What is the conclusion at a level of significance of 1094? 590? 19% 


5. 


Itis believed that using a new fertilizer will result in a yield of 1.6 tons per 

acre, A botunist carries out a two-tailed test on a field of 64 acres. 

a. Determine the p-value if the mean yield per acre in the simple is 1.72 
tons with a stindard deviation of 0.4 

b. What is the conclusion at a level of significance of 10%? 5%? 19@ 


6. A factory manager claims that the concentration of carbon monoxide in 
the air in the plant is only 3 parts per million. A government inspector 
believes the true figure is higher and gathers test data on 36 different clays, 
a. If the resulting tabulations give Ex = 109.08 and E(x ~ x)? = 504, what 

is the p-value? 
b. Can the manager's claim be rejected at a significance level of 1%? 2012 
SMe 


7. The agent of a secretarial pool claims that their average speed is 85 words 
per minute, but this is disputed by the personnel manager of a company 
Which is considering using the pool. A testis set up with a simple of 30 
typists from the pool, and their words per minute figures tabulate as 3x = 
2370 and Ex? = 197,287. 

a. What is the pyvalue? 
b. Should the agent's claim be rejected at a level of significance of 10%? 
He 


8. An author of a new book claims that anyone following his suggested diet 
program will lose an average of 28 pounds per week. A researcher 
believes that the trie figure will be lower and plans a test involving 36 
‘overweight people. She will reject the author's claim if the mean weight 
Joss in the volunteer group is less than 2.5 pounds per week. What is the 
probability of a Type Ul error if the true value is 2.6? If the true value is 
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2.47 Assume that the standard deviation among individuals is 1,2 pounds 
per week. 

9. A nursery owner predicts that a certain group of pine trees will grow an 
average of 25 centimeters per year, but a potential buyer is worried that 
the true rate will be lower. A test is set up with 100 trees, 

a. If the standard deviation between trees is 8 centimeters, what is the 
critical value? Assume a 196 significance level. 
b. What is the B-risk if the true mean growth rate is 24 cm? 22 em? 20 cm? 

10, A manufacturer claims that a motor will not draw more than 1.5 amperes 
under normal use conditions. A factory engineer suspects that this claim is 
low and plans to test 36 such motors. Suppose the standard deviation 
among motors is 0.9 amperes. If the true mean is 1.8 amperes, what is the 
probability of a Type II error associated with each of the following levels 
Of significance: 10%, 5%, and 1%? 

11. The following data from the National Bureau of Standards give daily 
radiation levels (in British thermal units) at 80 random North American 
locations: 1151, 236, 1107, 338, 848, 13, 142, 587, 555, 519, 506, 1106, 498, 
96, 467, 651, 486, 599, 953, 426, 504, 332, 1248, 872, 66, 936, 713, 1037, 
573, 848, 735, 524, 744, 890, 892, 526, 1149, 434, 899, 786, 1035, 590, 713, 
704, 931, 912, 565, 119, 435, 1292, 1066, 590, 566, 540, 604, 938, 539, 1127, 
566, 688, 1000, 633, 622, 1045, 984, 489, 603, 488, 592, 283, 252, 446, 502, 
764, 1224, 451, 1172, 583, 632, and 488. The practicality of using roof- 
mounted solar collectors for home heating depends on the level of daily 
solar radiation available. Suppose that a company producing Mat plate 
collectors needs an average daily radiation per square foot of horizontal 
surface of at least 750 Btu 
a. What is the p-value? 

b, What conclusion should be reached at the 9% significance level? At the 
1% significance leveP 


Study Questions 

1, What is involved in calculating the probability of a Type Il error in a 
twortailed test? For example, suppose that the averige age of persons 
chosen for jury duty is believed to be 37. To test this, a court clerk keeps 
track of the ages of 100 jurors, notes a standard deviation of 12 years, and 
tests the mean at the 5% level of significance. What is the B-risk if the true 
mean age of jurors is 35? 

2. We have seen that different possible values for the mean lead to different 
Brrisks. If we plot » along the 2vaxis and corresponding values of along 
the )+axis, the resulting graph is called the operating characteristic curve. 
Plot such a curve for Examples 12.11 and 12.12. 


13 
DIFFERENCES IN 
POPULATION MEANS 


Many real-life applications of statistics involve comparisons of two popula- 
tions. For example, is the average weight of laboratory rabbits receiving a 
special diet greater than that of rabbits on a standard die? Which of two 
accounting firms pays a higher mean starting salary? Is the life expectancy of 
coal miner less than that of a school teacher? Some examples entail 
relationships between three or more populations; one technique for analyzing 
such cases will be considered in Chapter 15. In this chapter we concentrate on 
‘comparing two populations—more specifically, on comparing the means of 
the two populations. 

First we consider how to compare the means of samples, one from each 
population. When is a difference between two such sample means significant? 
‘The answer is more apparent when we realize that what we are looking at 
is one difference from a set of differences. That is, there is the set of all 
possible differences obtained by subtracting simple means from one set from 
sample means of a second set. To judge the significance of one particular 
difference we must first determine how the differences vary among them- 
selves. The necessary key, discussed in Chapter 2, is the fact that the variance 
of a set of differences is equal to the sum of the variances of the individual 
sets, Thus: 


Now, if 


then 


+22 and 


11 
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CONFIDENCE INTERVAL ESTIMATE 
FOR DIFFERENCE 


In many situations it is clear that the mean of one population is higher than 
that of another, and we would like to estimate the difference. Using samples, 
‘we cannot find this difference exactly, but we will be able to say with a certain 
confidence that the difference lies in a certain interval. To find this confidence 
interval estimate, we follow the same procedure as set forth in Chapter 11 for 
a single mean, this time using wy ~ j12, Vloy/n, + oY/n,), and X, ~ Xin place 
of p, o/Nn, and X, respectively. 


Example 13.1 A 30-month study is conducted to determine the difference in the 
number of accidents per month between two departments in an 
assembly plant. Suppose the first department averages 12.3 acci- 
dents per month with a standard deviation of 3,5, while the 
second averages 7.6 accidents with a standard deviation of 3.4. 
Determine a 95% confidence interval estimate for the difference 
in the number of accidents per month. 


Answer: 
nm = 30 m = 30 
R= 123 %, = 76 
5 =35 s=34 


= (te fs, - [GF GO. ayy 
he ey ny” My 30 3 


‘The observed difference is 123 — 7.6 = 47, and the critical 
-z-scores are £1.96. Thus the confidence interval estimate is 4.7 * 
1,960.89) = 4.7 + 1.74. We are 95% confident that the first 
department has between 2.96 and 6.44 more accidents per month 
than does the second department. 


Trample 132 
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A survey is nun to determine the difference in the cost of groceries 
in suburban stores versus inner-city stores. A preselected group of 
items is purchased in a sample of 45 suburban and 35 inner-city 
stores, and the following data are obtained: 


‘Suburban Stores Inner-City Stores 
m= 45 % = 35 
= $3652 ¥, = $3940 

5, = $110 5, = $123 


Find 2 90% confidence interval estimate for the difference in the 
cost of groceries. 


71.645, 1.645 score 


‘The observed difference is 36.52 — 39.40 = —2.88, the critical 
scores are +1,645, and the confidence interval estimate is ~2.88 
+ 1.645(0.265) = —2.88 + 0.44. Thus we are 90% certain that the 
selected group of items costs between $2.44 and $3.32 Jess in 
suburban stores than in inner-city stores. 


‘A trucking firm conducts a test to compare the life expectancies 
of two brands of tires. Shipments of 1000 and 1500 tires, respec- 
tively, are received and marked, and the truck mileages are noted 
when the tires are replaced. The resulting raw data are as follows: 


Brand F Brand G 
7m, = 1000 my = 1500 
Ex = 22350,000 Ex = 36,187,500 


Xx — FF = 9,600,000,000 E(x — X)? = 15,800,000,000 


Determine a 99% confidence interval estimate for the difference 
in life expectancies. 
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Answer: Calculations of the means and standard deviations yield: 


R= 22350000 - 22.350 pp aia 
fceuaet 
= 3100 pean 32466 
[G100F fess, czar aay 


~ ¥t000- * 


258 2.58 28core 


‘The observed difference is 22,350 ~ 24,125 = ~1,775, and the 
critical =scores are 2.58. Since —1775 + 2.58(129) = —1775 + 
333, the trucking firm can be 99% sure that brand F tires average 
between 1442 and 2108 miles less in life expectancy than brand 
Giires, 


A hardware store owner wishes to determine the difference in 
drying times between two brands of paints. Suppose the standard 
deviation between cans in each population is 2.5 minutes. How 
large a sample (same number) of each must the store owner use 
if he wishes to be 98% sure of knowing the difference to within 
1 minute? 
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With a critical zscore of 2.33, we have 2.333.536/Vn) S 1, so 
Vin = 8.24 and n = 67.9. Thus the owner should test samples of 
68 paint patches from each brand, 


HYPOTHESIS TESTS: a-RISKS 
AND CRITICAL VALUES 
In this situation the null hypothesis is usually that the means of the 
populations are the same, or, equivalently, that their difference is 0: 
He ty ~ Wp = 0 
‘The alternative hypothesis is then: 
He: ty ~ v2 <0, Hy thy = he > 0, OF Hy wy ~ Wa #0 


‘The first two possibilities lead to one-sided (one-tailed) tests; the third possi- 
bility leads to two-sided (two-tailed) tests 


Example 135A dentist believes that toothpaste brand Cis better than toothpaste 

~~ brand P. She asks 30 of her patients to use each brand and records 
the numbers of cavities experienced over a 3-year period, She 
plans to reject any equality claim if the average number of cavities 
for brand C users is at least 1 fewer than the average for brand P 
users. If the standard deviations in cavities per person for each 
brand is 2.3, what is the probability of a Type I error, that is, of 
mistakenly rejecting a correct null hypothesis of equality? 


Answer: 
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i 


136 


i 


e 


‘The score for —1 is (~1—0)/0.594 = ~1.68. From Table A the 
resulting a-risk is 5000 — 4535 = .0465. 


A test is run to determine whether there is a difference in miles 
per gallon between two car models. Forty autos of the first model 
and 50 of the second are available for the test. What is the a-risk 
if the cut-off difference scores are +0.5? Assume standard devia- 
tions of 1.2 and 0.65 mpg, respectively, for the two models, 


Answer: 
Hoty ~ ty = 0 
Hygty ~ty #0 
21 
Fail to roject Ho 
Rojoct Roject Ho 
-05 ° 0.5 Dilerences 
: . ‘ 
237 ° 237 z-score 


‘The zscore for 0.5 is (0.5~-0/0.211 = 2.37. From Table A, each 
tail has a probability of .5000 ~ 4911 = .0089. Adding the two 
tails gives an a-risk of 0089 + .0089 = .0178. 


‘A historian believes that the average height of soldiers in World 
‘War If was greater than that of soldiers in World War 1. She 
‘examines the records of 100 men in each war and notes standard 
deviations of 25 and 23 inches in WWI and WWII, respectively, 
With an a-risk of .05, what is the critical difference in heights? 


Answer: 

Host ~ Hy = 0 
Het ~ ty > 0 
a= 05 
(ae, 3F _ 


Fall to roject Ho 
eject Ho 
0 056 Differences 
+ + 
° 1845 Score 


‘The critical zscore is 1.645, and the critical difference is 
1,645(0.340) = 0,56 inches. 


! 


‘An educator plans to evaluate two different teaching methods by 
comparing the average test scores of two classes of students, each 
taught by a different one of the techniques under study 
the two classes have 35 and 38 students respectively, 
ard deviations in test scores of 24,5 and 21.2 respectively. What 
are the critical differences if the test is to be run with an at-risk 


of OL 

Answer: 

Reject Hy Reject Ho 
139 0 13.9 Dierences 
. * + 
258 0 258 Z6core 


The critical values are =2.58(5.38) = £13.9. Thus, if the average 
test scores show a difference of more than 139, the educator will 
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claim he has sufficient evidence that the two teaching methods 
yield different results. 


HYPOTHESIS TESTS: CONCLUSIONS 
AND p-VALUES 


‘The fact that a sample mean from one population is greater than a sample 
mean from a second population does not automatically justify a similar 
conclusion about the means of the populations themselves. Two points need 
to be stressed. First, the means of samples from the same population vary from 
each other. Second, what we are really comparing are confidence interval 
estimates, not just single points. 


Exonple 139 A sales representative believes that his company's computer has 
more average downtime per week than does a similar computer 
sold by a competitor. Before bringing his concern to his director, 
the sales representative gathers data and runs a hypothesis test. 
He determines that in a sample of 40 week-long periods in 
different firms using his company’s product, the average down- 
time was 125 minutes with a standard deviation of 37 minutes, 
However, 35 week-long periods involving the competitor's com- 
puter yield an average downtime of only 115 minutes with a 
standard deviation of 43 minutes. What conclusion should the 
sales representative draw, assuming a 10% significance level? 


Answer: 
Hoth ~ ba = 0 
Hew ~ ta > 0 


a=10 


Fal to roect Ho 


Reject Ho 
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With a = .10 the critical zscore is 1.28, so the critical difference 
is 0 + 1.28(9.33) = 11.9 minutes. Since the observed difference is 
125 — 115 = 10, and 10 < 11.9, the sales representative does not 
have sufficient evidence that his company’s computer has more 
down time. 


The strength of the observed difference in downtimes can be measured by 
finding the p-value. The z-score of the difference in sample means is 10/9.33 
= 1.07. Using Table A, we find the p-value to be 5000 ~ .3577 = .1423. Thus, 
while the observed difference is not significant at the 10% significance level, it 
‘would be significant at, for example, the 15% level. 


Erenple 13.10 A store manager wishes to determine whether there isa significant 
difference between two trucking firms with regard to the handling 
of egg cartons. In a sample of 200 cartons on one firm's truck, 
there was an average of 0.7 broken eggs per carton with a 
standard deviation of 0.31, while a sample of 300 cartons on the 
second firm's truck showed an average of 0.775 broken eggs per 
carton with a standard deviation of 0.42. Is the difference between 
the averages significant at a significance level of 5%? At a level 


of 19 
Answer: 
00327 
Fall to reject Ho 
Roject Ho eject Ho 
0.084 ° 0.064 Differences 
—-— — + 
196 ° 196 score 


This is a twoxailed test, so, for a total a = .05, we place a 
probability of % (.05) = .025 on each side. The critical z-scores 
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are +1.96, which translate into critical differences of 0 + 
1,96(0.0327) = +0.064. Since the observed difference is 0.7 — 
0.775 = -0.075 and ~0.075 < -0.064, the difference in the 
average numbers of broken eggs per carton is significant at the 


59 significance level. 
Fail to reject Hy 
ject Ho, Roject Ho 
0.084 0 (0.084. Ditlerences 
+ . - 
-258 ° 258 Zscore 


With a = 01, the critical z-scores are +2.58, and so the critical 
differences are 0 + 2.580.0327) = +0.084. Since -0.075 is 
between —0.084 and 0.084, the observed difference is not signifi- 
cant at the 1% significance level. 


A more specific measure of the strength of the observed difference in 
broken eggs per carton is found by calculating the p-value. The observed 
difference of 0.075 has a z-score of —0.075/0.0327 = ~2.29. Using Table A, 
we find that this gives a probability of 5000 ~ 4890 = .0110. The test is 
tworsided, so the p-value is 2.0110) = .0220. 


Example 13.11 A medical researcher believes that taking 1000 mg vitamin C per 

~~ day will result in fewer colds than will a daily intake of 500 mg. 
In a group of 50 volunteers taking 1000 mg/day the numbers of 
colds per individual during a winter season are tabulated leading 
to 


Ex = 90 and Xx — 0 = 73.5. 


Similar tabulations from a group of 60 volunteers taking 500 
mg/day give 


Zy = 144 and Sty — 9 = 944, 


a. If the experiment is run at the 2% significance level, what 
should be the conclusion? 


Reject 
0.488 0 

SS 
-205 0 


For @ = .02 the critical =score is ~2.05, so the critical difference 
is 0 — 2.05(0.238) = —0.488. The observed difference is 1.8 — 2.4 
= -06. Since ~0.6 < ~0.488, the researcher will conclude that 
fewer colds result when people take 1000 mg vitamin C per day 
rather than only 500 mg/day. 


b. What is the p-value? 


Answer: The observed difference of —0.6 has a zscore of 
=0.6/0.238 = —2.52. Using Table A, we find a p-value of 5000 ~ 
4941 = .0059, 


Example 13.12 A realtor believes that the average rent in one location exceeds 
the average rent in a second location by more than the published 
figure of $125. She runs a hypothesis test on 30 rental units in each 
location and obtains averages of $480 and $330, and standard 
deviations of $95 and $80, respectively, for the two locations. 
‘What should the realtor conclude at the 5% significance level? 


152 The Population Mean 
Answer: In this problem the null hypothesis is not that the 
difference is 0. 

Hg: — My = 125 
Hat ~ Wy > 125 


a= 05 


Reject H, 
125 1623 Ditlerences 
+ + 
° 1.645, ecore 


With « = .05 the critical z-score is 1.645, so the critical difference 
is 125 + 1,645(22.68) = 162.3. The observed difference is 480 — 
330 = 150. Since 150 < 162.3, the realtor does not have sufficient 
evidence to claim that the rental difference is more than $125. 

‘There was, however, some observed difference, and its strength 
can be measured by finding the pyalue. The 2score of the 
observed difference of 150 is (150 — 125)/22.68 = 1.10. Using 
Table A, we find the pyalue to be 5000 ~ 3643 = .1357. 


EXERCISES 


1. Ina study Journal of the American Medical Association, June 13, 1990, 
page 3040) aimed at reducing developmental problems in low-bith-weight 
(under 2500 grams) babies, 347 were given a special educational curricu- 
lum while 561 did not receive any special help. After 3 years the children 
who had received the special curriculum showed a mean 1Q of 93.5 with 
a standard deviation of 19.1; the other children had a mean IQ of 84.5 with 
a standard deviation of 19.9. Find a 95% confidence interval estimate for 
the difference in mean IQ's of low-weight babies who receive special 
intervention and those who do not. 

2. Does socioeconomic status relate to age at time of HIV infection? For 274 
high-income HIV-positive individuals the average age of infection was 33.0 
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years with a standard deviation of 6.3, while for 90 low-income individuals 
the average age was 286 years with a standard deviation of 6.3 (The 
Lancet, October 22, 1994, page 1121). Find a 90% confidence interval 
estimate for the difference in ages of high and low income people at the 
time of HIV infection. 


An admissions director conducts a study to compare the scores of male 


and female students on the verbal section of a national standardized exam, 
Test results of 400 women and $00 men are noted to be as follows 


Women: m = 400, Sx = 208,000, 3(x — 3? = 760,000 
Men: n = 500, Ex = 255,000, 3.x ~ 3) = 1,050,000 


Determine a 98% confidence interval estimate for the difference in mean 
verbal test results, 


. An engineer wishes to determine the difference in life expectancies 
between two brands of batteries. Suppose the standard deviation of each 
is 4.5 hours. How large a simple (same number) of each battery should 
be taken if the engineer wishes to be 90% certain of knowing the 
difference in life expectancy to within 1 hour? 


A recent study of health service costs of coronary angioplasty (PTCA) 
versus coronary artery bypass surgery (CABG) at a London hospital (The 
Lancet, October 1, 1994, page 929) showed an average cost of 6176 
pounds with a standard deviation of 329 for 231 angioplasties, and an 
average cost of 8164 pounds with a standard deviation of 264 for 221 
bypass surgeries. Is this sufficient evidence at the 1% significance level to 
say that the average cost of angioplasties is less than the average cost of 
bypass surgeries? 

. Fifty-three children treated for lead poisoning had their 1Q's tested before 

treatment and 6 months later Journal of the American Medical Association, 
April 7, 1993, page 1644). Before treatment these children with unhealthy 
amounts of lead in their blood showed an average 1Q of 83:5 with a 
standard deviation of 10.2; 6 months later their average IQ was 88.1 with 
a standard deviation of 11.2. Is this enough evidence to support the claim 
that average IQ improves after treatment for lead poisoning? Test with a 
= 05, 
‘An educator believes that professors at liberal arts colleges give higher 
grades than their colleagues in professional schools, She obtains a sample 
of 500 grades given at a liberal arts college and tabulates Ex = 1300 and 
Ex? = 3460, and a sample of 350 grades given at a professional school 
and tabulates Ex = 896 and x? = 2360. What conclusion should she draw 
at the 10% significance level? 

3. A study reported in the New York Times (February 16, 1993, page C3) of 
roughly 75 left-handed and 925 right-handed people found the average 
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age of the left-handed people at death to be 66 years while the average 
age of the right-handed people at death was 75 years. What additional item 
of information is necessary to determine whether this is sufficient evidence 
at a 5% significance level to support the theory that right-handed people 
live longer than left-handed people? (This study was heavily criticized 
because it was based on recollections of friends and families and did not 
represent a good cross-section of the population.) 


Study Questions 
1, (A paired difference test) The analysis and procedure developed in this 
chapter require that the two samples being compared be independent of 
‘each other. However, many experiments and tests involve comparing two 
populations for which the data naturally occur in pairs. In this case, the 
proper procedure is to run a one-simple test on the single variable 
consisting of the differences from the paired data 
For example, suppose an efficiency expert wishes to analyze the 
difference in productivity between workers exposed to two different 
lighting arrangements. She chooses a procedure whereby the same 30 
employees work under each arrangement for 1 week in turn, Each 
employee's ourput under the two lighting arrangements, LAI and LA2, are 
summarized as follows. 


Em 1 2.3 4 5 6 7 8 9 0 12 13:14:15 
LAL 15 23 18 19 7 13 2 2 2 19 25 2 15 19 19 
TAZ 12 22 18 20 24 12 2 2 19 17 2% 2 M17 15 
1617 18 19 20 21 22 23 4 25 2% 27 28 2 30 
21 20 3% 24 16 15 2 19 20 17 2 12 10 23 26 
16 22 26 24 19 19 18 19 15 20 2 14 13 20 25 


a. Form the set of thirty differences (be careful with signs). 

b. Runa single sample hypothesis test, Hy: w= 0, Hy: #0, = .050n 
this set to determine whether the differences in lighting arrangements 
are significant. 

. Find the pvalue (you should get .0324). 

2. a, Using the data given in Study Question 1, find the mean and standard 
deviation of each of the two 30-element sets corresponding to LAT and 
LA2 separately. 

b. Run a two-sample hypothesis test, Hy: 1 — Ha = 0, Hy: Hy ~ ba * 0, 
a= 65. 

. Find the pvalue (you should get 3524), 

This seems to indicate that the observed difference is not significant in 

contrast with the analysis and conclusion in Study Question 1. The reason 
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for this apparent inconsistency is that the test used in Study Question 2 is 
not the proper one. This two-sample test should be used only when the 
two sets are independent. In this case, there is a clear relationship between 
the data, in pairs, and this relationship is completely last in the procedure 
for the two-sample test. 


14 


SMALL SAMPLES 


Statisticians and other researchers would save numerous man-hours if small 
samples could be used instead of larger ones. In many situations, such as 
estimating the completion time for a certain model of nuclear reactor or 
determining the damage sustained by luxury cars from front-end collisions, it 
is prohibitively difficult or expensive to gather enough data for a large sample, 
In other situations, such as deciding whether to add or drop a TV series after 
only a few shows have been aired, there is simply not sufficient time to gather 
large quantitiess of data. 


STUDENT t-DISTRIBUTION 


If we have a normal population, then, for m small, the set of means of all 
samples of size 7 is still normally distributed, but the set of variances is not 
‘The modified sample variances, E(x ~ X)?/n ~ 1), do have a mean equal to 
the population variance, 02, but their distribution is skewed with more low 
values than high ones. Thus (% — 4)/(sNin) has more high values (both plus 
and minus) and so does not follow a normal distribution, We can no longer 
say, for example, that 95% of the sample means are within 1.96 (sn) of the 
population mean. More than 5% of the sample means will lie outside these 
critical values. 

If, however, the original population is normally distributed, there is a 
distribution that can be used when working with the sin ratios. This Student 
tedistribution, was introduced in 1908 by W. S. Gosset, a British mathematician 
‘employed by the Guiness Breweries. (IF we are working with small samples 
from a population that is mot nearly normal, we must use very different 
“nonparametric” techniques not discussed in this book. If 1 is large, we use 
the normal distribution tables.) 

Thus, for a small sample from a normally distributed population, we work 
with the variable 


with a resulting ¢-distribution that is bell-shaped and symmetrical, but is lower 
at the mean, higher at the tails, and so more spread out than the normal 
distribution. 
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«Like the binomial distribution, the tdistribution is different for different 
Values of n. In the tables these distinct Fdistributions are associated with the 
values for degrees of freedom (df). For this discussion the dfvalue is equal to 
the sample size minus 1. The smaller the df value, the larger is the dispersion 
in the distribution. The larger the df'value, that is, the larger the sample size, 
the closer the distribution comes to the normal distribution. 

Since there is a separate tdistribution for each df value, fairly complete 
tables would involve many pages; therefore, in Table B of the Appendix we 
give areas and evalues for only the more commonly used percentages or 
probabilities. The last row of Table B is the normal distribution, which is a 
special case of the @distribution taken when 1 is infinite. For practical 
Purposes, the two distributions are very close for any = 30. However, when 
more accuracy is required, the Student édistribution can be used for much 
larger values of m. 

Note that, whereas Table A gives areas under the normal curve from the 
mean to positive zvalues, Table B gives areas to the right of given positive 
Hvalues. For example, suppose the sample size is 20, so df= 20 — 1 = 19. 
‘Then a probability of .05 in the fail comesponds with a tvalue of 1.729, while 
.01 in the tail corresponds to 1 = 2.539. 

Examples 14.1-14.7 involve small samples used for estimating population 
means, hypothesis testing on population means, and comparing two popula- 
tion means. In all of these examples, it is assumed that the original populations 
are normally distributed. Without this stipulation, we could not use the 
distribution, as explained above. 


SMALL SAMPLE ESTIMATES OF 
A POPULATION MEAN 


Example 141 When ten cars of a new model were tested for gas mileage, the 
results showed a mean of 27.2 mpg with a standard deviation of 
1.8 mpg. What is a 95% confidence interval estimate for the gas 
mileage achieved by this mode? 
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2.262 2282 Score 


Answer: Because the sample size is less than 30, we must use the 
‘t-distribution. The standard deviation of the sample means is 


18 
= 18 =o: 
OE Tip 7 0.569. 


‘With 10 — 1 = 9 degrees of freedom, and 2.5% in each tail, the 
appropriate f-scores are 2.262. Thus we can be 95% centain that 
the gas mileage of the new model is in the range 27.2 + 
2.262(0.569) = 27.2 + 1.3, or between 25.9 and 28.5 miles per 
gallon, 


Brample 142 When 25 jars of peanut butter labeled as “18-ounce” are weighed, 
the totals are as follows: Ex = 448.5 and E(x — 0 = 0.41, What 
is the 99% confidence interval estimate for the mean weigh? 


Answer: We first calculate the sample mean and standard devia- 
tion: 


a 


dta24 
99 

| las 

“2797 2797 +Scoro 


‘The standard deviation of the sample means is estimated to be 
~ OB 
os ™ Tom pe = 0.026. 


With 25 — 1 = 24 degrees of freedom, and 0.5% in each tail, the 
appropriate tscores are £2,797. Thus a 99% confidence interval 
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estimate for the mean weight is 17.94 + 2.797(0.026) = 17.94 * 
0.073 ounces. 


Example 143A new process for producing synthetic gems yielded six stones 

—~— _ weighing 0.43, 0.52, 0.46, 0.49, 0.60, and 0.56 carats, respectively, 
in its first run, Find a 90% confidence interval estimate for the 
mean carat weight from this process 


Answer: 
x = EX _ 243 + 052+ 046 + 049 + 060 + 056 
” 6 
= 38 = 051 


(0.08) + (OI? + (05) + (002) + (09) + (0.05) 


5 
= 0.0632 
5 _ 00632 
og ~ fe = OH = 0258 
an 6 
aes 
0 
% F 
2015 2015 tecore 


With df = 6 — 1 = 5, and 5% in each tail, the ¢scores are 2.015. 
‘Thus we can be 90% sure that the new process will yield stones 
weighing 0.51 + 2.015(0.0258) = 0.51 + 0.052, or between 0.458 
and 0.562 carats. 


SMALL SAMPLE HYPOTHESIS TESTS OF A 
SAMPLE MEAN 


Example 144 A cigarette industry spokesperson remarks that current levels of 
tar are no more than 5 milligrams per cigarette. A reporter does a 
quick check on 15 cigarettes representing a cross section of the 
market. 
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Example 145 A local chamber of commerce claims that the mean sale price for 
homes in the city is $90,000. A real estate salesperson notes eight 
recent sales of $75,000, $102,000, $80,000, $85,000, $79,000, 
‘$95,000, $98,000, and $62,000. Should the chamber of commerce’s 
claim be rejected at a significance level of 5%? At a level of 10%? 


Answer: 
Reject Hy Rojoct H, 
90,000 Price, 
a a 
2365 ° 2965 core 


For a = .05, we place a probability of .025 in each tail, and note 
that df = 8 ~ 1 = 7 gives critical tscores of *2.365. Critical sales 
Prices are 90,000 + 2.364717) = 90,000 * 11,156, or $78,844 
and $101,156, Since the observed mean of $84,500 is in this range, 
there is not sufficient evidence to reject the claim at the 5% 
significance level 


Fad to roject H, 


Reject Hy Peject Hy 


1895, ° 1.895 acore 
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For a = .10, we place a probability of .05 in each tail, and note 
critical scores of +1.895. Critical sales prices are 90,000 + 
1.895(4717) = 90,000 + 8939, or $81,061 and $98,939. The 
‘observed mean of $84,500 is also in this range, so even at the 10% 
significance level the claim cannot be rejected 


SMALL SAMPLE DIFFERENCES IN 
POPULATION MEANS 


In this case, we must assume not only that both original populations are 
normally distributed, but also that their variances are equal. Then for 0; (see 
Chapter 13) we get 


or Sx- XR =(1-Dst, — istouse 
Lx —¥,¥ + Dr - HY 


tm —D+™,—D 


for sf, where the first sum is over the first sample and the second sum is over 
the second sumple. This quotient can be rewritten as 


(ny = Dst + On, = Ds} 
m +m, 


and finally we have the complex-looking equation 


ny = Ds Hm = DS [T,1 
mtm-2 m Mm 


‘The degrees of freedom in such a situation are 


yf 


D+(m-D 


Eranple 146 Two varieties of com are being compared as to difference in 
maturation time. Ten plots of the first variety reach maturity in an 
average of 95 days with a standard deviation of 5.3 days, while 
eight plots of the second variety reach maturity in an average of 
74 days with a standard deviation of 4.8 days. Determine a 95% 
confidence interval estimate for the difference in maturation time, 
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m=10 m=8 
R= Fa 74 
H=53 2 = 48 


~ [eO=DG3r + @= GEE TT _ 
Say 1048-2 ia india 


212 212 score 


With df = 10 + 8 ~ 2 = 16, and .025 in each tail, the critical 
Fscores are $2.12, The observed difference was 95 ~ 74 = 21, 
so the confidence interval estimate is 21 + 2.12(2.41) = 21 + 5.1 
Thus, we can be 95% certain that the first variety of com will take 
between 15.9 and 26.1 more days to reach maturity than will the 
second variety. 


A city council member claims that male and female officers wait 
equal times for promotion in the police department. A women's 
spokesperson, however, believes women must wait longer than 
men. If five men waited 8, 7, 10, 5, and 7 years for promotion 
while four women waited 9, 5, 12, and 8 years, what conclusion 
should be drawn at the 10% significance level? 


Answer: 
Hoty ~ Bp = 0 
Hetty ~ hy <0 
a=10 
ayes 
mas Ha StTtOt5t7 _ 3, 


(e- 


TAP + 0 = 74? + 00 ~ 749 + G74? +0 = 74 _ ag 


= 


= 
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mad 3,2 2tStts 


[9 =85F + G— aSy +02 — aS ORE 
27] at 289 
~ (maser FG —neae fT 

344 sq 


85 


Fail to reject H, 
Reject Hy 
° Ditlerences 
° score 


With df = 5 + 4 — 2 = 7, and .10 in the tail, the critical #score 
is —1.,415. The critical difference is ~1.415(1.57) = ~2.22. Since 
the observed difference is only 74 — 8.5 = —1.1, there is not 
sufficient evidence to dispute the council member's claim, 


EXERCISES 


In the following exercises, there is the underlying assumption that all 
original populations bave nearly normal distributions. In Exercises 11-15, 
there is the additional assumption that the two original variances are equal. 


1. In a sleep-laboratory experiment, 16 volunteers sleep an average of 7.4 
hours with a standard deviation of 1.3 hours. What would be a 99% 
confidence interval estimate for the mean number of hours that people 
sleep at night? 

2. A doctor notes the number of hours that 17 surgery patients spend in the 
recovery room after use of a new anesthetic. If Ex = 35.7 and Xx — 3? 
= 0.096, what is an 80% confidence interval estimate for the mean number 
of hours spent in the recovery room by patients using the new anesthetic? 

3. Acute renal graft rejection can happen years after the grafi. In one study 
(The Lancet, December 24, 1994, page 1737) 21 patients showed such 
late-acute rejection when the ages of their grafts (in years) were 9, 2,7, 1, 
4,7, 9, 6, 2, 3, 7, 6,2, 3, 1, 2,3, 1, 1, 2, and 7. Establish a 90% confidence 
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interval estimate for the ages of renal grafts that undergo late-acute 
rejection. 


4, Nine subjects, 87 to 96 years old, were given 8 weeks of progressive-re- 
sistance weight training (Journal of the American Medical Association, June 
13, 1990, page 3032). Strength before and after training for each individual 
was measured as maximum weight in (kilograms) lifted by left knee 
extension: 


Before: 3 35 4 6 7 8 85 125 15 
After: 7 17 19 12 19 22 2% 2 2 


Find 95% confidence intervals estimates for: 
a. Before training strength 
b, After training strength: 
¢. Strength gain 

5. An IRS representative claims that the average deduction for medical care 
is $1250. A taxpayer who believes that the real figure is lower samples 12 
families and comes up with a mean of $934 and a standard deviation of 
$616, What conclusion should the taxpayer reach at a 5% significance 
lever 


6 An auditor remarks that the accounts receivable for a company seem to 
average about $2000. A quick check of 20 accounts gives a mean of $2250 
with a standard deviation of $600. 1s this sufficient evidence to dispute the 
auditor's remark at a 5% significance level? 


7. A magazine article states that teenagers watch 2 hours of television per 
day. A sociologist who believes the true figure is higher interviews 18 
teenagers and calculates Ex = 504 and Ex = 1638. What is the 
conclusion at the 196 significance level? 


8. PCB contamination of a river by a manufacturer is being measured by 
amounts of the pollutant found in fish. A company scientist claims that the 
fish contain only 5 parts per million, but an investigator believes the figure 
is higher. What is the conclusion if six fish are caught and show the 
following amounts of PCB (in pans per million): 68, 5.6, 5.2, 4.7, 6.3, and 
5.4? Test at the 5% significance level. 


9. The 26 contestants in the Pentathlon event of the 1992 Olympics ran the 
200 meter dash in the following times (in seconds): 25.44, 24.39, 25.66, 
23.93, 23.34, 25.01, 24.27, 24.54, 25.44, 24.86, 23.95, 23.31, 24.60, 23.12, 
25.29, 24.40, 25.24, 24.43, 23.70, 25.20, 25.09, 24.48, 26.13, 25.28, 24.18, 
and 23.83 (Journal of the American Statistical Association, September 1994, 
page 1101). At the 1% significance level, test the null hypothesis that the 
mean time for Pentathlon contestants is 25 seconds against the alternative 
hypothesis that the mean time is less than 25 seconds. 


10. 


wL 


2 


13. 


4 


15, 
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The weight of an aspirin tablet is 300 mg according to the bottle label. 
Should an FDA investigator reject the label if she weighs seven tablets and 
obtains weights of 299, 300, 305, 302, 299, 301, and 303 milligrams? Use a 
2.5%6 significance level for this two-tailed test 

Six capsules of drug A took an average of 75 seconds with a standard 
deviation of 14 seconds to dissolve, while the average time for six 
capsules of drug B was 71 seconds with a standard deviation of 1.7 
seconds. Establish a 99% confidence interval estimate for the difference in 
dissolving time between the two brands. 

‘The mean height of eight dwarf apple trees is 8.1 feet with a standard 
deviation of 2.3 feet; for five dwarf cherry trees the mean height is 9.8 feet 
with a standard deviation of 2.8 feet. What is the 95% confidence interval 
estimate of the difference in heights between these two fruit trees? 

A hospital exercise-laboratory technician notes the resting pulse rates of 
five joggers to be 60, 58, 59, 61, and 67, while the resting pulse rites of 
seven nonexercisers are 83, 60, 75, 71, 91, 82, and 84. Establish a 90% 
confidence interval estimate for the difference in pulse rates between 
joggers and nonexercisers. 

A researcher believes a new diet should improve weight gain in laboratory 
mice. If ten control mice on the old diet gain an average of 4 ounces with 
a standard deviation of 0.3 ounces, while the average gain for ten mice on 
the new diet is 4.8 ounces with a standard deviation of 0.2 ounces, is the 
researcher's claim justified at the 5% significance level? 

An employer wishes to compare typing speeds of graduates from two 
different programs of study. Eight graduates of the first course type at 62, 
85, 59, 64, 73, 70, 75, and 72 words per minute, while six graduates of the 
second course have typing speeds of 75, 64, 81, 55, 69, and $8 words per 
minute, Is the observed difference significant with a = .20? 


Study Question 

(A paired difference test) An accounting firm measured the blood pressures 
of ten of its CPAs before and then during the spring 1996 tax season. The 
systolic pressures for the ten individuals, designated as A — J, were as follows: 


Al Bie Ce DUR Pe 1G. Re 
Before: 110 124 98 105 115 120 118 110 123 95 
During: 115 126 97108-15124 «11911312196 


1s there sufficient evidence that blood pressures rise during tax season? 

a. Assuming normal populations with equal variances, try to apply the 
two-population, small-sample hypothesis test developed in this chapter 
on Hy: by — Hz = 0, He by — Hy <0. Show that this test does not 
indicate that the observed difference is significant even at a = .10. 

b. Looking at the data, however, it is apparent that a change in blood 
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pressure occurred. The test used in part a does not apply the know!- 
‘edge of what happened to each individual. In fact, since the before and 
during sets are not independent, but rather are related in pairs, that test 
is not the proper one to use. The appropriate test is a one-population, 
small-sample hypothesis test on the set of differences. Form the set of 
ten differences, being careful of sign. Then test the hypotheses H,: js = 
0, He: & <0, and show that the sample mean is significant at a = .05, 


15 
COMPARING MORE THAN 
TWO MEANS 


We now consider cases in which three or more populations are being 
compared, A sample is taken from each population, and the mean of each 
sample is calculated. When can we say that the observed differences in the 
sample means are significant? When should we say that there is insufficient 
evidence that the actual population means are different? 

‘One method would be to use the distribution to compare the first sample 
‘mean with the second, the second with the third, and so on. It is not obvious, 
however, how to combine these results into a statement about Hy: thy = ba = 
By = 


VARIANCE BETWEEN AND WITHIN SAMPLES 


Another approach is suggested by the following example. Consider these 
samples taken from three different populations: 


1. 10.1, 10.8, 97, 9.3, 10.1) with mean 10 
ML (14.2, 149, 15.3, 15.4, 15.21 with mean 15 
Ml, 1205, 198, 19.9, 20.1, 19.71 with mean 20 


Intuitively, there seems to be sufficient evidence to conclude that the means 
of the three original populations are probably not identical. However, suppose 
that the samples had been as follows: 


1.2, 25, ~14, ~3, 40} with mean 10 
H. (=4, 82, 0, —15, 12) with mean 15 
ML. (42, —30, 87, 0, 11 with mean 20 


Here, there does not at all seem to be sufficient evidence that the means of 
the three original populations are different. Note that in the first example the 
differences between the sample means are large when compared to the 
variations within each sample, whereas in the second example the differences 
between the sample means are very small when compared to the variations 
within the samples. 
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Given that differences observed between sample means are more significant 
when they are large compared to differences within each sample, we consider 
a ratio of the form: 


differences between 
differences within 


‘The larger such a ratio, the more evidence that the original population means 
are different. What should go into a ratio of this kind? The logical choice for 
measuring differences is variance. Therefore, we will define and calculate a 
“variance between” and “variance within,” and then form a ratio of these two 
variances. If this ratio, called the F-ratio, is greater than some critical Fvalue, we 
will conclude that there is sufficient evidence to reject Hy: fy = Ha = Hy = » 


TS1 Following are monthly sales figures (number of houses sold) for 
15 real estate agents who work under three different types of 
contract: 


Straight salary: 4,7,8,5,6 
Straight commission: 7, 2, 4, 5,7 
Salary plus commission: 7, 9, 6, 6, 7 


Calculation of means for the 


anne & 


Are these observed differences significant? Is there sufficient evi- 
dence to reject Hy: Wy = Ha = Hy? 
Answer: The variance within samples, in » is calculated by 
averaging the individual variances: 


gf = ALOE + 7 ~ OF + B~ 6) +5 ~ 6% + 6-H 
t 


1 Ca 
=, 
=B=25 
= (= SP + 2~ 52 +4 - 5 + - 5 + O7- 5 
3-1 
8, 
B= 45 
2 _ (7-78 + 9-78 + 6-7 + 6-4-7 
pte s-1 
6 
=$=15 


tke. 25 +45 + 15 _ ayy 
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The variance between samples, Gfyecq > iS calculated by first 
determining o2 (the variance of the Simple means), and then 
USING Dogeen = M02 (from a; = 0/Vn): 


go? = G2 +6-6) 
bs 3-1 


‘The F-ratio is defined to be 


23 


Featin 


Is this ratio large enough to enable us to reject H.? 

To answer this we must first find a critical Evalue. Like the 
distribution, the distribution is actually a family of distributions. 
Where the tdistributions are associated with single degrees of 
freedom, however, the Fdistributions are associated with pairs of 
degrees of freedom. There is a df value for the numerator, and a 
second df value for the denominator, 


df of the numerator = number of samples = 1 = k= 1 
Af of the denominator = (number of samples)(sample size = 1) 
= kn) 


Separate tables exist for each level of significance under consid- 


In this example, there are 3 — 1 = 2 degrees of freedom for 
the numerator, and 365 - 1) = 12 degrees of freedom for the 
denominator. Using, for example, a 
Evalue of 3.88. Since 1.76 < 3.88, there is not suffic 
to reject H,. 


FORMAL PROCEDURE TO COMPARE 
SEVERAL SAMPLE MEANS 


Just as in using the distribution to compare two sample means, we must 
be able to assume, when using the Fdistribution, that all original populations 
are normal, and that their variances are equal. Then, if we have & populations 
and a sample of size n from each, we test the null hypothesis as follows. 
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1, Calculate the sample means: X;, %, «., K Where 


5-28 
x28 


2. Calculate the sample variances: sf, , ., 9 where 
Lar - FF 


= 2a t 
g nt 


3. Calculate the overall mean: 
tht. +, 
eo z 


4, Calculate the variance of the sample means: 


ot = LBP t GG — we tt Gy we 
ad k-) 


5. Calculate the variance within samples: 


_Stgtuts 


Fane F 
6. Calculate the variance between samples: 
Shame 1} 


7. Calculate the Fratio: See 


8. Find the critical Evalue by using the appropriate Ftable (depending on 
significance level), and looking under dfnunerace = #1 and dficncnnstx 
= Kn D). 

9 If Fratio > Feritical, be aware that there is sufficient evidence to reject 
H,. Otherwise, fail to reject Hy. 


‘The above procedure, called one-factor or one-way or one-dimensional 
analysis of variance, was developed in the 1920s by Ronald Fisher, and the F's 
in Fscore, Fratio, and Ftable are derived from his name. Similar, but more 
involved, methods are available when two factors have effects and are under 
consideration, 

Here is a second example of one-way analysis of variance (abbreviated 
ANOVA). 


Exonple 152 A consumer spokesperson claims that the price of a gallon of milk 
————_ in a major city varies by location. She notes the prices in four 
stores chosen in each of five locations. 


Location 1: 1.98, 2.01, 2.02, 1.99 
Location 2: 2.04, 2.03, 2.04, 2.07 
Location 3: 2.06, 2.08, 2.08, 2.08 
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Location 4: 1.95, 1.97, 1.96, 1.98 
Location 5: 2.00, 1.99, 2.02, 2.01 


Is there sufficient evidence to reject the null hypothesis H,: 
ths = ba = ty ™ ba = Bs at the 1% significance level? 
Answer: The sample means are as follows: 
= = 198+ 201 +202 +199 _ 
Ls 
x, = 204 + 203 + 204 + 207 _ 
fe 7 
=, — 206+ 208 + 208 + 208 
5, ~ 200+ 208 + 208 + 208 
= 15+ 197 +196 + 198 «1965 
3,5 MAIDA ED. y 
‘The sample variances are as follows 


2000 


2045 


2075 


005, 


(198 — 2)? + (201 — 2¥ + (202 — 2% + (1.99 — 2% 
ae 7-1 


= SOE = 00008333, 


(204 — 2045) +... + (207 - 2045 


ae 71 

= 220? = 0003 

gq x 20 = 2075) +...+ (208 ~ 20759 

3 7-1 

= 20003 = 0001 

gg = AIS= 1965)? +... + (1.98 - 196592 

Wee a1 

= S2 = 00001667 
(200 ~ 2005 +... + (201 ~ 2005)? 

s- a-1 

= 200° = ano01667 

The overall mean is 


_ 2000 + 2045 + 2075 + 1965 + 2005, 


7 = 2018 


174 The Population Mean 


‘The variance of sample means is 


(2 ~ 2.018) + (2.045 ~ 2.018+...+(2.005 ~ 2.018)? 
3-1 


= 200728 — aooise 
The variance within samples is 


gig, = 10008383 + 0.0003 + 0.0001 + 00001667 + 00001667 
ren 3 


= 00002133 
‘The variance between samples is 


Shewcen = N92 = KOOOIR2) = 000728 


The Fratio is 
= Thesessn = 000728 
Fe ae ooo2iss ~ 15 
Since Uymerse =~ 1 = 5-1 = 4, Ufscnominaioe = KO — 1) = 


(4 — 1) = 15, and a = .01, the critical Fvalue is Fact ~ 4.90. 
Then, since 34.13 > 4.90, there is sufficient evidence to reject H,. 
‘The consumer spokesperson is justified in claiming that the price 
of a gallon of milk varies by location 


EXERCISES 


1, Lime tree honey (L), fruit tree honey (F), and acacia honey (A) were tested 
for healing properties against each of four bacteria strains (The Lancet, 
March 20, 1993, page 757). Measurements of the effectiveness of the 
honeys on a 0 to 5 scale were as follows: 


L 44,43 F:1,2,2,0 ABA BT 


At a 51% significance level is there sufficient evidence that the three honeys 
do not have the same effectiveness as healing agents? 

2. A double-blind test was run to judge the effectiveness of a new drug (cA2) 
in treating rheumatoid arthritis (The Lancet, October 22, 1994, page 1107). 
Of a group of 75 patients, 25 were given a placebo, 25 were given low 
doses of cA2, and 25 were given high doses of cA2. After 4 weeks of 
treatment, the patients’ swollen joints were measured on a 1 to 60 scale 
with the following results: 
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At a 19% significance level is there sufficient evidence that the three 
treatments produce different results? 

3. An executive authorizes a study to examine the length of time between 
order and service of personal pan pizzas in different franchise units. In 
each of five restaurants, 15 lengths of times (in minutes) are observed, 


Franchise 1 Franchise 2 ‘Franchise 3. Franchise 4 Franchise 5 


3 4 4 3 4 
2s 35 35 35 35 
3 3 4 5 5 
4 45 4 5 5 
35 4 4 35 4 
6 4 3 35 65 
35 35 35 4 4 
4 45 5 3 45 
25 5 4s 4s 5 
3 55 6 35 35 
4 4 35 3 6 
4 35 3 3 45 
35 45 4 as 4 
4 4 35 4 4 
35 as 4 35 35 


Is the executive justified in concluding that different franchises prepare 
‘orders quicker than others? Use a = .01. 


Study Questions 
1, A company efficiency expert who wishes to compare the average product 
assembly times on three different shifts gathers the following data. 


Shift 1: 13, 10, 10, 11, 11 
Shift 2: 13, 14, 17, 12 
Shift 3: 10, 11,7, 9, 8, 9 


where the numbers give observed assembly times (in minutes). Is there 
sufficient evidence to conclude that the average assembly times on the 
three shifts are different? 

Note that the sample sizes are different, so our procedure must be 
modified slightly. As before, we calculate the sample means, 3), 3 , and 
35, and the sample variances, 51, ;, and sy. The overall mean pis found 
by averaging all the sample data together (rather than averaging %, % , 
and 3% as before). To calculate the variance between samples, we use a 


Part 5 


THE POPULATION 
PROPORTION 


Chapters 11-15 dealt with estimates and tests for a population 
mean. Equally important in numerous applications are techniques 
and procedures involving a population proportion. For example, 
what proportion of registered voters support a proposed bond issue? 
What proportion of dentists recommend sugarless gum? What pro- 
portion of defense contracts are canceled because of cost overruns? 
What proportion of new houses sell during the first month on the 
market? Even when more refined measurements are possible, it 
may be sufficient to simply look at a proportion. For example, it 
may be necessary to know, not the mean salary, but rather just 
what proportion of the salaries are above $50,000. It may be 
necessary to know, not the mean beight of basketball players, but 
rater just the proportion of players above 6 feet 5 incbes tall. And 
it may be necessary to know, not the mean number of new policies 
sold each week, but rather just the proportion of weeks for which at 
least ten new policies are sold. 

Just as with means, it is usually impossible or impracticable to 
gather complete information about proportions. Thus we must use 
techniques to make inferences about a population proportion when 
only a sample proportion is available. As we should now expect, 
results will involve confidence intervals and degrees of certainty; 


180 The Population Proportion 


Here, however, we are interested in the proportion, rather than the number 
of successes. From Chapter 2 we remember that when we multiply or divide 
every element by a constant, we multiply or divide both the mean and the 
standard deviation by the same constant. In this case, to change number of 
successes to proportion of successes, we must divide by m: 


an nm —m) _ |a(l— a) 
wpe Tew ant o, = SHED _ RID 


Furthermore, if each element in an approximately normal distribution is 
divided by the same constant, it is reasonable that the result is still an 
approximately normal distribution. 


Hp=% Proportion of successes 


‘Thus the principle forming the basis of much of what we do in the following 
chapters is as follows: 

Start with a population with a given proportion 7, Take all samples of size 
n. Compute the proportion in each of these simples. Then: 


1. The set of all sample proportions will be approximately normally 
distributed. 
2. The mean of the set of sample proportions will equal 7, the population 


Proportion. 
3. The standard deviation op of the set of sample proportions will be 


approximately equal to |= 4 ==), 


Example 16.1 Suppose that 70% of all dialysis patients will survive for at least 5 

———— years. If 100 new dialysis patients are selected at random, what is 
the probability that the proportion surviving for at least 5 years 
will exceed 80%? 


Answer: The set of sample proportions is approximately normally 
distributed with mean .70 and standard deviation 


A 


Toe” = 0458 
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With a z-score of (.80~.70)/.0458 = 2.18, the probability the 
sample proportion exceeds 80% is 5000-4854 = .0146. 


In finding confidence interval estimates for the population proportion 7, 


how do we find a= a2 since m is unknown? The reasonable 


procedure is to use the sample proportion 


Finally, as with means, if we have a certain confidence that a sample 
proportion lies within a specified interval around the population proportion, 
we have the same confidence that the population proportion lies within a 
specified interval about the sample proportion (the distance from Missoula to 
Whitefish is the same as the distance from Whitefish to Missoula). 


CONFIDENCE INTERVAL ESTIMATE OF THE 
POPULATION PROPORTION 


Trample 162 If 64% of a simple of $50 people leaving a shopping mall claim 

Ive spent over $25, determine a 99% confidence interval 
te for the proportion of shopping mall customers who 
spend over $25, 


Answer: Since P = 64, the standard deviation of the set of sample 


proportions is 
~ (Bo _ 
\ 30 0205 


‘The 996 confidence interval estimate for the population propor- 
tion is 64 + 2,58(.0205) = .64 + .053. Thus we are 99% certain 
that the proportion of shoppers spending over $25 is between .587 
and .693. 


182 The Population Proportion 


[63 In a random sample of machine parts, 18 of 225 were found to be 
damaged in shipment. Establish a 95% confidence interval estimate 
for the proportion of machine parts that are damaged in shipment. 


Answer: The sample proportion isp = 18/225 = .08, and the 
standard deviation of the set of sample proportions is 


= ig 08X92) 


0181 


‘The 95% confidence interval estimate for the population propor- 
tion is ,08 + 1.96(.0181) = .08 + .035, Thus we are 95% certain 
that the proportion of machine parts damaged in shipment is 
between 045 and .115. 


Suppose there are 50,000 parts in the entire shipment. We can translate from 
proportions to actual numbers: .045(50,000) = 2250 and .115(50,000) = 5750, 
80 We can be 95% confident that there are between 2250 and 5750 defective 
parts in the whole shipment. 


United States began bombing Inq. 

a. If 832 voiced their support for this action, with what confidence 
can it be asserted that 83.2% + 3% of the adult U.S. population 
supported the decision to go to war? 


=01s 


ete 


254 Zscore 
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The relevant zscores are +.03/.0118 54, Table A gives a 
probability of .4945, so our answer is 2(.4945) = .9890. In other 
words, 83.2% + 3% is a 98.90% confidence interval estimate for 
USS. adult support of the war decision. 


b. If the adult U-S. population is 191,000,000, estimate the actual 
‘numerical support. 


Answer: Since _.802(191,000,000)_~ — 153,000,000, while 
862(191,000,000) = 165,000,000, we can be 98.90% sure that 
between 153 and 165 million adults supported the initial bombing 
decision. 


Example 165 A US. Department of Labor survey of 6230 unemployed adults 
~~ classified people by marital status, sex, and race. The raw numbers 
are as follows: 


white, 16 ys and over nonwhite, 16 ys ond over 

Married Widow/Div Single Maried  Widew/Div Sing 
Men 1090 337 1168 266, 135 503 
Women 952 43 632189 169 


a, Find a 90% confidence interval estimate for the proportion of 
unemployed men who are married 


Answer: Totaling the first row across, we find that there are 3499 
men in the survey; 1090 + 266 = 1356 of these are married, 
‘Therefore 


1.645 1.645 Z8core 


Thus the 90% confidence interval estimate is 3875 + 
1,645(.008236) = .3875 + .0135. 
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b. Find a 98% confidence interval estimate of the proportion of 
unemployed single persons who are women. 


Answer: There are 1168 + 632 + 503 + 349 = 2652 singles in 
the survey. Of these 632 + 349 = 981 are women, so 


B= 981/2652 = 3699 and 


@, ~ _L2XSow 


A = 009375 


233 233 score 


‘The 98% confidence interval estimate is 3699 + 2.33(,009375) = 
3699 * 0218. 


Example 166 An assembly-line quality check involves the following procedure: 

~~ _ A sample of size 50 is randomly picked, and the machinery is shut 
down for repairs if the percentage of defective items in the sample 
is ¢ percent or more, Find the value of ¢ that results in a 90% 
‘chance that the machinery will be stopped if, on the average, the 
percentage of defective items it is producing is 15%, 


Answer: Since = = 15, 
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‘The zscore corresponding to a 90 probability is —1.28, so ¢ = 
15 — 1.28(.0505) = .085 or 8.5%. 


SELECTING A SAMPLE SIZE 


‘One important consideration in setting up a survey is the choice of sample 
size, To obtain a smaller, more precise interval estimate of the population 
proportion, we must either decrease the degree of confidence or increase the 
‘sample size, Similarly, if we want to increase the degree of confidence, we may 
either accept a wider interval estimate or increase the sample size. Again, while 
choosing a larger sample size may seem desirable, in the real world this 
decision involves time and cost considerations. 

In setting up a survey to obtain a confidence interval estimate of the 
population proportion, what should we use for @? To answer this question, 
we first must consider how large V1 — 7) can be. Plotting various values 
of 


gives .5 for the intuitive answer. Thus is at most <= We make use 
” vn 


of this fact to determine sample sizes in problems such as Examples 16.7 and 
16.8. 


i 


‘An EPA investigator wants to know the proportion of fish that are 
inedible because of chemical pollution downstream of an offend- 
ing factory. If the answer needs to be within +.03 at the 96% 
confidence level, how many fish should be in the sample tested? 


Answer: We want 2.050; = .03. By the above remark, apis at most 
S/n, so it is sufficient to consider 2.05(.5/Vn) = .03, Algebrai- 
cally, we get Vit = 2.05(.5)/.03 = 34.17, and m = 1167.4. There- 
fore, choosing a sample of 1168 fish will give the inedible 
proportion to within +.03 at the 96% level 


—I I~ 
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Note that the accuracy of the estimate does not depend on what fraction of 
the whole population we have sampled. What is critical is the absolute size of 
the sample. Is some minimal value of 1 necessary for the procedures we are 
using to be meaningful’ Since we are using the normal approximation to the 
binomial, both np and n(1 ~ p) should be at least 5 (see Chapter 10). 


Example 168 A study is undertaken to determine the proportion of industry 
executives who believe that workers’ pay should be based on 
individual performance. How many executives should be inter- 
viewed if an estimate is desired at the 99% confidence level to 
Within +.06? To within +.03? To within +.027 


Answer: Algebraically, 2.58.5/Vn) $ .06 gives Vin = 2.58(.5)/.06 
= 215, so n = 462.25, Similarly, 2.58(.5/Vn) = .03 gives Vn = 
2.58(.5)/,03 = 43, so n = 1849. Finally, 2.58(.5/Vm) = .02 gives 
Vin & 2,58(,5)/.02 = 64.5, so n = 4160.25. Thus 463, 1849, or 4161 
executives should be interviewed, depending on the accuracy 
desired. 


‘To cut the interval estimate in half (from *.,06 to +.03), we would 
have to increase the sample size fourfold (from 462.25 10 1849). 
To cut the interval estimate to a third (from +.06 to +.02), a 
ninefold increase (from 462.25 to 4160.25) in the sample size 
would be required, 


More generally, to divide the interval estimate by d without affecting the 
confidence level, we must increase the sample size by a multiple of a2. 


EXERCISES 


1. A USA Today “Lifeline”column reported that in a survey of 500 people 39% 
said they watch their bread while it's being toasted. Establish a 90% 
confidence interval estimate for the percentage of people who watch their 
bread being toasted. 

2. Ina survey funded by Burroughs-Welcome, 750 of 1000 adult Americans 
said they didn't believe they could come down with a sexually transmitted 
disease (STD). Construct a 95% confidence interval estimate for the pro- 
portion of adult Americans who don't believe they can catch a STD. 

3. During the 1991 recession, a random sample of 130 of a bank's 13,000 
home mortgage accounts showed 26 such loans delinquent in payments, 
a. Determine a 99% confidence interval estimate for the proportion of 

home mongage accounts that are behind in their payments. 
b. Translate this answer into an estimate of the actual number of delin- 
quent accounts. 


Confidence Interval of the Proportion 


4, In a random sampling of 450 hospital deaths, the following numbers were 
noted with regard to underlying causes. 


Couse of Death 
Heart Motor vehide 
disease Concer ‘cident other 
‘Adult males 12 2 18 Fa 
Adult females 8 o 16 15 
Children 8 R 1s 10 


a. Establish a 98% confidence interval estimate for the proportion of all 
deaths due to heart disease. 

b, Establish a 94% confidence interval estimate for the proportion of 
female deaths due to cancer. 

¢. Establish an 80% confidence interval estimate for the proportion of 
motor vehicle deaths which involve children. 

5. A 1993 Los Angeles Times poll of 1703 adults revealed that only 17% 
thought the media was doing a “very good” job, With what degree of 
confidence could the newspaper say that 17% + 2% of adults believe the 
media is doing a “very good” job? 

6, Suppose that, before marketing a new “improved” laundry detergent, an 
advertising agency plans to simple 60 consumers and check whether at 
least c% notice an improvement. Find the value of ¢ that results in a 95% 
chance of acceptance of the detergent if, in realty, 75% of all consumers 
can detect an improvement. 

7. A politician wants to know what percentage of the voters support her 
position on the issue of forced busing for integration, What size voter 
‘sample should be obtained to determine with 90% confidence the support 
level to within 4942 

8. In a 1994 New York Times poll about a candidate's popularity, the news- 
paper claimed that in 19 of 20 cases its poll results should be no more than 
3 percentage points off in either direction. 

a. What confidence level are the pollsters working with? 
'b, What size sample should they have obtained? 


Study Questions 
1. Suppose that, of a group of six (WV = 6) people: A, B, GD, E, and F, two 
(Band B own their own houses. Thus the proportion of the group owning 
their homes is = 2/6 = 1/3, Now consider all possible samples of size 
4 
4. List the samples [there are C(6,4) = 15], and determine the proportion 
Poof home owners for each sample 
b. Calculate the mean and standard deviation for this set of 15 proportions. 
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‘Show that the mean is equal to 7, and that the standard deviation is 


caualto [OD ae 
©. What happens to /™¢ 


large? 

2. To maximize Vm(l ~ 2), it is sufficient to maximize (1 — ) =m ~ m2 
Let f'n) = w — w?, and use elementary calculus to show that f takes its 
maximum value when a = .5. Note that the value of Vm(1 = m) att = .5 
is .5. Alternatively, graph the parabola y= — a, and note that the vertex 
isatm = 5, 

3, Pollsters often express results in a form such as 43% plus or minus 5%, 
a, What critical piece of information is missing from such a statement? 
b. How does this omission affect the usefulness, meaning, or strength of, 

such statements? 


X= ifthe population size Nis very 


17 
HYPOTHESIS TEST OF THE 
PROPORTION 


Closely related to the problem of estimating a population proportion is the 
problem of testing a hypothesis about a population proportion. For example, 
a travel agency might determine an interval estimate for the proportion of 
sunny days in the Virgin Islands or, alternatively, might test a tourist bureau's 
claim about the proportion of sunny days. A major stockholder in a construc- 
tion company might ascertain an interval estimate for the proportion of 
successful contract bids or, alternatively, might test a company spokesperson's 
claim about the proportion of successful bids. A social scientist might find an 
interval estimate for the proportion of homeless children who attend school or, 
alternatively, might test a school board member's claim about the proportion 
of such children who are still able to go to classes. In each of these cases, the 
researcher must decide on whether the interest lies in an interval estimate of 
population proportion o in a hypothesis test of a claimed proportion. 

‘The general testing procedure is very similar to that developed for testing a 
claim about a population mean. Again, there is a specific hypothesis to be 
tested, called the mull bypothesis, which is stated in the form of an equality 
statement about the population proportion (for example, H,: = .37). There 
is the alternative bypotbesis, stated in the form of an inequality (for example, 
Hy: ™ < 37 oF Hy: ® > 37 oF Hy: ® * 37). The null hypothesis cannot be 
proved incorrect with absolute certainty; rather, it may be shown to be 
improbable. 

‘The testing procedure involves picking a simple and comparing the sample 
proportion to the claimed population proportion 7. A critical value (or 
titical values) cis (are) chosen to gauge the significance of the sample statistic 
If the observed B is further from the claimed proportion = than is the critical 
value, we say that there is sufficient evidence to reject the null hypothesis. For 
example, if H,:  <.37 and if ¢ = 33, a sample = .35 would not be sufficient 
evidence to reject Hy, 

In practice, an acceptable a-risk, or probability of committing a Type I error 
and mistakenly rejecting a true null hypothesis, is chosen. This a-risk, also 
called the significance level of the test, is used to determine the critical value(s). 
‘The strength of the sample statistic B can further be gauged through its 
associated p-value which gives the smallest value of a for which H,, would be 
rejected. Finally, the possibility of a mistaken failure to reject a false null 
hypothesis. This is called a Type Terror and has associated probability B. There 
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is a different value of B for each possible correct value for the population 
parameter ™. 


TYPE I ERRORS, CRITICAL VALUES, 
p-VALUES 


Temple 17.1 A local restaurant owner claims that only 15% of visiting tourists 

stay for more than 2 days. A chamber of commerce volunteer is 
sure that the real percentage is higher. He plans to survey 100 
tourists and intends to speak up if at least 18 of the tourists stay 
longer than 2 days. What is the probability of a Type I error? 


Answer: 
He m= 15 
Hy ® > 15 


Fall to reject Hy 


Reject H, 


Proportions: 


0 08s score 


Using the claimed 15%, we calculate the standard deviation of 


sample proportions to be 
[asx85) 
100 


‘The critical proportion is c = 18/100 = .18, so the critical z-score 
is (.18 — .15)/.0357 = 0.84. Thus a = .5000 — 2995 = .2005, The 
test, as set up by the volunteer, has a 20.05% chance of mistakenly 
rejecting a true null hypothesis. 


9; 0357 


A union spokesperson claims that 75% of union members support 
a strike if their basic demands are not met. A company negotiator 
believes the true percentage is lower and runs a hypothesis test at 
the 10% significance level. What is the conclusion if 87 of 125 
union members say they will strike? 
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Answer: 
Hy w= 75 
Hg <5 
Faillto reject H, 
Reject Hy 
7004 75 Proportions. 
+ + 
“1.28 ° score 


We use the claimed proportion to calculate the standard deviation 
of the sample proportions. 


= [55 _ 
o5 Be = 03873 


With a = 10 the critical z-score is ~1.28, so the critical proportion 
is .75 — 1.28(.03873) = .7004. The observed sample proportion is 
B= 87/125 = .696. Since ,696 < 7004, there is sufficient evidence 
to reject H, at the 10% significance level. The company negotiator 
should challenge the union claim. 

To measure the strength of the disagreement between the 
sample proportion and the claimed proportion, we calculate the 
p-value (also called the attained significance level). The zscore 
for .696 is (.696~.75)/.03873 = ~1.39 with a resulting pvalue of 
5 — ANT7 = .0823. There is not sufficient evidence to reject H,, 
at the 5%, or even the 8%6, significance level 


A cancer research group surveys 500 women over 40 years old to 

test the hypothesis that 28% of this age group have regularly 

scheduled mammograms. 

a. Should the hypothesis be rejected at the 5% significance level 
if 151 of the women respond affirmatively? 


Answer: Since no suspicion is voiced that the 28% claim is low 
or high, we run a two-sided (also called two-tailed) test. 
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0201 


Fail to reject H, 
Reject H, Reject H, 
2a 28 39 Proportions 
. - + 
196 ° 1.96 score 


With a/2 = .025 in each tail, the critical zscores are +1.96, and 
the critical proportions are .28 + 1.96(.0201) = .241 and .319. The 
observed p = 151/500 = .302. Since .302 is not outside the critical 
values, there is not sufficient evidence to reject #5 that is, the 
cancer research group should not dispute the 28% claim, 


b, What is the p-value? 


Answer: The =score for 302 is (302~.28)/.0201 = 1.09, which 
corresponds to a probability of 5000 ~ .3621 = .1379. Doubling 
(because the test is two-sided), we obtain a p-value of 2(.1379) 
2758. Thus, for example, the null hypothesis should not be 
rejected even if a were a relatively lange .25. 


TYPE I ERRORS 


Why not always choose a to be extremely small so as to eliminate the 
possibility of mistakenly rejecting a correct null hypothesis? The difficulty is 
that this choice would simultaneously increase the chance of never rejecting 
the null hypothesis even if it were far from true. Thus, for a more complete 
picture, we must also calculate the probability B of mistakenly failing to reject 
a false null hypothesis. As was the case with means, there is a different value 
B for each possible correct value of the population proportion @. The operating 
characteristic curve, or OC-curve, a graphical display of B values, is often given 
in the reabife analysis of a hypothesis test 
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TAA soft-drink manufacturer received a 9%6 share of the market this 
past year, The marketing research department plans a telephone 
survey of 3000 households. If fewer than 8% of respondents 
indicate they will buy the company’s product, the research depart- 
ment will conclude that the company’s market share has dropped 


and will order special new promotions. 
a. What is the probability of a Type I error? 
‘Answer: 
Hy ® = 09 
Hy ® <9 
«5° osu = 005225 
Fall to reject H, 
ject Hy 
os ry Mant share 
191 ° score 


‘The zscore for .08 is (08 ~ .09)/.005225 = -1.91, soa = 5 — 
A719 = 0281. 


b. What is the probability of a Type Il error if the true market 
share is .085? In other words, if the market share really has 
dropped to 8.5%, what is the probability that the research 
department will mistakenly fail to reject the 9% null hypothesis? 


Answer: The score for .08 now is (.08 ~ .085)/,005225 = —0.96. 
‘Thus B = .5000 + 3315 = 8315. 


Fal to reject H, 
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¢. What if the true market share is .075? 


Answer: The =score for .08 now is (.08 — .075)/.005225 = 0.96. 


Fall to reject H, 
O75 08 Market share 
° 096 score 


Note that the further the true value is (in the suspected direction) from the 
claimed null hypothesis, the smaller the probability is of failing to reject the 
false claim. 


Example 175 A building inspector believes that the percentage of new construc- 
tion with serious code violations may be even greater than the 
previously claimed 7%. A hypothesis test is planned on 200 new 
homes at the 10% significance level. What is the value of B if the 
true percentage of new constructions with serious violations is 9%? 
Is 1196 Is 1392 


Answer: 


12 Proportions 
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‘The 10% significance level gives a critical z-score of 2.33 and a 


critical proportion of .07 + 2.33(.018) = .112. 
Fall to reject H, 
Reject Hy 
09 2 Proportions 
— 
° 122 score 


If the true percentage is 9%, the Escore for 112 is (112 — 
09)/,018 = 1,22. Then B = .5000 + 3888 = B88. 


Fall to roject H, reject Hy 

MM Proportions 
ooo 

oon z8core 


If the true percentage is 11%, the z-score for 112 is (112 — 
-11)/.018 = 0.11, Then B = .5000 + .0438 = 5438. 


Fall to reject Hy, Reject H, 
m2 13 Proportions 
oo 


If the true percentage is 13%, the z-score for .112 is (112 - 
.13Y/,018 = —1. Then B = 5000 ~ 3413 = 1587. 
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‘A tabulation of these values (and a few more) for B is as follows: 


Tue | 08 | 09 
pT amas Tame Tas | 456 1 s300 1 ase7 Tose 1 oir 


OPERATING CHARACTERISTIC AND 
POWER CURVES 


‘The resulting graph is called the operating characteristic (OC) curve. 


100 
ns 
tN 
: 

ss . 1 ee was 


If is the probability of failing to reject a false null hypothesis, then 1-B 
the probability of rejecting the false null hypothesis. The power of a 
hypothesis testis the probability that a Type II error is not committed, and the 
graph of 1=B, shown below, is called the power curve. 


1.00 
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Both the operating characteristic curve and the related power curve help us 
gauge the effectiveness of a hypothesis test. 


EXERCISES 


1. In an effort to curb certain diseases, especially AIDS, San Francisco has a 
program whereby drug users can exchange used needles for fresh ones. 
As reported in the Journal of the American Medical Association January 
12, 1994), 35% of 5644 intravenous drug users in San Francisco admitted 
to sharing needles. Is this sufficient evidence at the 1% significance level 
to say that the rate has dropped from the pre-needle-exchange rate of 66%2 

2, In 1993 it was estimated that 1,000,000 of 141,000,000 adults (0.7196) in the 
United States were HIV positive. A later study, reported in The New York 
Times, found 29 HIV positive adults in a sample of 7782. Is this sufficient 
evidence to dispute the 0.71% claim at the 5% significance level? 

3, The 1948 Kinsey study of human sexuality claimed that 10% of Americans 
are homosexuals. In a University of Chicago study, reported in Tbe New 
York Times, of 3432 people, approximately 2.19% identified themselves as 
homosexuals. 

a, What is the p-value? 
b. Is this sufficient evidence to reject the Kinsey study at the 1% signifi- 
cance level? 


4, According to the Las Angeles Times, the president of the National Coffee 
Association has claimed that “caffeine as normally consumed poses no 
health risk ...and cannot be described as a substance of dependence.” 
Researchers from Johns Hopkins University reported that in a sample of 
27 coffee drinkers 16 displayed all the signs that define classic depend- 
ence. 

a, What is the p-value? 

b. Is this sufficient evidence to reject the coffee association president's 
claim at the 2% significance level? 

5. One nationwide report on physical fitness claims that the proportion of 
middle-level managers who participate in a regular exercise program is .13. 
A company considering the provision of free health club memberships for 
its employees plans to survey 200 managers. The new program will be 
installed immediately if the proportion of exercisers is less than .10. 

a. What is the probability of a Type I error, that is, of rushing ahead when 
there really is no difference in managers’ exercise participation from the 
nationwide proportion? 

'b. What is the probability of a Type II error if the true company proportion 
is 12? If it is 0 

6. A test is planned at the 10% significance level to determine whether more 
than 40% of persons eligible for food stamps make full use of the program. 
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If 500 eligible people are interviewed, determine the probability of a Type 
I error if the true percentage of users is 42%, What if itis 44967 

7. A business magazine article states that 82% of college graduates majoring 
in international economics find work in their field. A guidance counselor 
believes that the true figure is lower and runs a hypothesis test on 230 
such graduates at the 5% significance level. 
a. Find the value of B for each of the following eight possible true 

percentages: 83%, 84%, ..., 90%. 

b, Sketch the operating characteristic curve. 


Study Questions 

1, Suppose that your rival in the marketing office has what you consider to 
be a dumb idea. Your boss, who knows nothing about statistics, is 
relatively conservative but will follow up on this dumb idea if there is good 
evidence that more than 25% of the company’s customers like it, A survey 
of 40 customers is taken, and 12 of them favor the idea. Your rival points 
out that 12/40 = 30%6. You know a little about statistics and perform the 
usual hypothesis test with Hy: © = .25 and Ay: > .25. Write a paragraph 
explaining to your boss why these data are not strong evidence that more 
than 25% of the customers like your rival's idea 

2. (Type II error in a two-tailed test) It is estimated that 60% of all grocery 
store customers clip, save, and use advertisers’ money-off coupons. A store 
manager surveys 300 customers and runs a test at the 5% significance level. 
What is the value of B if the true percentage of customers using coupons 
is 53042 58962 63007 68942 
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DIFFERENCES IN POPULATION 
PROPORTIONS 


Numerous important and interesting applications of statistics involve the 
comparison of two population proportions. For example, is the proportion 
of satisfied purchasers of American automobiles greater than that for buyer 
of Japanese cars? How does the percentage of surgeons recommending a 
new cancer treatment compare with the corresponding percentage of oncolo- 
gists? What can be said about the difference between the proportion of single 
Parents who are on welfare and the proportion of two-parent families on 
welfare? 

Our procedure involves comparing two sample proportions. When is a 
difference between two such sample proportions significant?” We note that we 
are dealing with one difference from the set of all possible differences obtained 
by subtracting sample proportions of one population from sample proportions 
of a second population. To judge the significance of one particular difference, 
‘we must first determine how the differences vary among themselves, As in the 
similar discussion regarding the comparison of two means, a necessary tool 
from Chapter 2 is that the variance of a set of differences is equal to the sum 
of the variances of the individual sets; that is, 


=o} +o} 
aa) = mD 
Nowito, = [EE=B ang, = [OD 
™ J 
pe MOD RO) ay 
eS ee 
o.* LAG =) , mo 2 


As in the preceding chapters on proportions, we are using the normal to 
estimate the binomial, so we will assume that 


mm, m1— 7), mm, and n,.— 7) 


are all at least 5. 
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CONFIDENCE INTERVAL ESTIMATES 
FOR DIFFERENCES 


Often it is clear that some difference exists between two population 
proportions, and we would like to find a numerical measure of the difference, 
Using samples, we cannot determine this difference exactly, however, we can 
say, with a specified degree of confidence, that the difference lies in a certain 
interval, We follow the same procedure as set forth in Chapter 16, this time 
using 


Trample TET Suppose that 84% of a sample of 125 nurses working 7 AM. to 3 

—————_ PM. shifis in city hospitals express positive job satisfaction, while 
only 7296 of a sample of 150 nurses on 11 PM, to 7 A.M, shifts 
express similar fulfillment. Establish a 90% confidence interval 
estimate for the difference, 


Answer: 


in, = 125, m, = 150 
By = 84, By = 72 


= 0492 


1.645, 1.645, pscore 


‘The observed difference is 84 — .72 = .12, and the critical 
zscores are +1.645. The confidence interval estimate is .12 + 
1.645(.0492) = .12 + .081. We can be 90% certain that the 
proportion of satisfied nurses on 7 to 3 shifts is between .039 and 
201 higher than for the proportion for nurses on 11 to 7 shifts. 
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Example 182 A grocery store manager notes that, in a sample of 85 people 

—~——_ going through the “under seven items” checkout line, only 10 paid 
with checks; whereas, in a sample of 92 customers passing 
through the regular line, 37 paid with checks. Find a 95% confi- 
dence interval estimate for the difference between the proportion 
of customers going through the two different lines who use 
checks. 


Answer: 


1.98 1.98 pacor8 


‘The observed difference is 118 — 402 = —.284, and the critical 
zécores are +1.96. Thus, the confidence interval estimate is ~.284 
‘+ 196.0619) 284 + 121. The manager can be 90% sure that 
the proportion of customers passing through the “under seven 
items” line who use checks is between .163 and .405 lower than 
the proportion going through the regular line who use checks, 


HYPOTHESIS TESTS 


For many problems the null hypothesis states that the population propor- 
tions are equal, or, equivalently, that their difference is 0: 


Hy — m= 0 
‘The alternative hypothesis is then 


Hg. — %2 <0, He % — Fz > 0, or Hy: % — 72 ¥ 0, 
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where the first two possibilities lead to one-sided (one-tailed) tests, while the 
third possibility leads to two-sided (two-tailed) tests. 

Since the null hypothesis is that =, = 7, we call this common value m, and 
use it in calculating o¢ 


fm =m) , w—m) _ sa - {+ 1) 
7 a 1 mG, 


In practice, if 
= % - 
i-S om b 

we use 
ponte 
m+n; 


as an estimate for w in calculating oy. 


Suppose that, early in an election campaign, a telephone poll of 
800 registered voters shows 460 in favor of a particular candidate, 
Just before election day, a second poll shows only 520 of 1000 
registered voters expressing the same preference. At the 10% 
significance level is there sufficient evidence that the candidate's 
popularity has decreased? 


‘Answer: 
Hem -m= 0 == 575 
He ™-%>0  _ 529 
a=10 Ps = yoo = 


Fiejoct H, 
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With @ = .10, the critical zscore is 1.28, so the critical difference 
is 0 + 1.28(.0236) = .0302. The observed difference is 575 — .520 
= .055. Since .055 > .0302, we conclude that at the 10% signifi- 
cance level the candidate's popularity bas dropped. 

The strength of this drop can be further measured by calculat- 
ing the p-value. The z-score for .055 is (.055—0)/.0236 = 2.33, so 
the p-value is 5000 — 4901 = .0099. Thus there is sufficient 
evidence of a drop in popularity even at the 1% significance level. 


An automobile manufacturer tries two distinct assembly proce- 
dures. In a sample of 350 cars coming off the line using the first 
procedure, there are 28 with major defects, while a sample of 500 
autos from the second line shows 32 with defects, 

a. Is the difference significant at the 696 significance level? 


Answer: Since there is no mention that one procedure is believed 
to be better or worse than the other, this is a two-sided test. 


eject Ho Rooject Ho 
=0337 ° 0337 Ditierences 
+ + 
1.88 ° 1.88 ‘ascore 


With a = .06, we put .03 in each tail; the critical zscores are 
‘£1.88. The critical differences are 0 + 1.88(.0179) = +.0337. The 
observed difference is .080 — .064 = .016. Since .016 is between 
the two critical values, we conclude that the observed difference 
is not significant at the 69 level. 
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Exomple 185 


b. What is the p-value? 


Answer: The score for .016 is (0 — .016)/.0179 = 0.89, which 
corresponds to a tail probability of 5000 — .3133 = .1867. 
Doubling because the test is two-sided results in a pvalue of 
2.1867) = .3734, Thus the smallest significance level for which 
the observed difference is significant is a very large 37.34%. 


A survey of 5000 medical students compared the career goals of 
men and women, 
oreer Goal 
Surgery Gynecalogy Pediatrics Paychatry Other 
Men 32 520 472 610 1026 
Women 128 350 31 400 71 


a. Establish a 95% confidence interval estimate for the difference 
between the proportions of men and women intending to 
become surgeons. 


Answer: 
ae 312 irae 

A ~ 5ir¥ 520 + 472 + G10 + 1026 ~ 2940 ~ 16 

128 128 062 


Ps ~ TRF 350 + 391 + 400 + 791 2060 


‘The observed difference is 106 — .062 = .044, so the confidence 
interval estimate is .044 + 1.96(.00778) = .044 + .015. Therefore 
‘we can be 95% certain that the proportion of men heading toward 
a career in surgery is between .029 and .059 higher than the 
proportion of women. 
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1b, Is the observed difference between the proportions of men and 


women hoping to become gynecologists significant at the 2% 
significance leve?? 


ou cara 264 st + xs) = 0109 


allt reject My 


025 ° (025 Ditorences 


2s ° 239 score 


The critical z-scores are +2.33, so the critical differences are 0 + 
2.3K.0109) = +.025. The observed difference is .177 ~ .170 = 
.007. Since .007 is between —.025 and .025, there is not sufficient 
‘evidence to reject H,. The observed difference between the 
proportions of men and women with gynecology as a career goal 
is not significant at the 2% level. 


c. Is there sufficient evidence at the 0.59% significance level that a 
higher proportion of women want to become pediatricians than 


Fall to reject Hy 
Roject Ho 
Diferences ° 028 
- 
Peco ° 2575 


‘The critical zscore is 2.575, so the critical difference is 0 + 
2.575(.0109) = .028. The observed difference is 190 ~ .161 = 
029. Since .029 > .028, there is sufficient evidence to reject Hy. 
At the 0.5% significance level there is evidence that a greater 
proportion of women than men plan to become pediatricians. 


SELECTING A SAMPLE SIZE 


Setting up experiments or surveys involves many considerations, one of 
which is sample size. For example, one topic in this chapter involves making 
interval estimates of differences in population proportions. Generally, if we 
‘want smaller, more precise interval estimates, we either decrease the degree of 
confidence or increase the sample size. Similarly, if we want to increase the 
degree of confidence, we may either accept a wider interval or again increase 
the sample size 

In Chapter 16 we noted that Vag = =) is at most .5. Thus 


a a{ +2) = ON 


Now, if we simplify by insisting that 7, = 1; = 1, the above statement reduces 
as follows: 


T&6 A pollster wants to determine the difference between the propor 

——— tions of high-income voters and low-income voters who support 
a decrease in capital gains taxes. If the answer needs to be known 
to within +.02 at the 95% confidence level, what size samples 
should be taken? 


Answer: Assuming we will pick the same size samples forthe two 
sample proportions, we have og = .5V2/Vn and 1.9604 = .02 


Differences in Population Proportions 207 


Thus, 1.96.5V2/Vn = .02. Algebraically we find that Vn = 
1,96(.5)N2/,02 = 69.3. Therefore, n = 69.3? = 4802.5, and the 
pollster should use 4803 people for each sample. 


pines reported (Science, March 
11, 1994, page 1491) that 210 of 361 members in biology are women, but 
only 34 of 86 members in mathematics are women, Establish a 95% 
confidence interval estimate for the difference in proportions between 
‘women in biology and women in mathematics in the Philippines, 


2. In a random sample of 300 elderly men, 65% were married, while in a 
similar sample of 400 elderly women, 48% were married. Determine a 99% 
confidence interval estimate for the difference between the percentages of 
elderly men and women who are married. 


3, Suppose that 23 of 94 oil company executives recommend selling short on 
major oil stocks, while 34 of 80 stockbrokers recommend such sales, 
Construct a 90% confidence interval estimate for the difference between 
the percentages of oil executives and stockbrokers who recommend 
selling oil stocks short. 


4. The Baldus-Woodworth 5 year study of Georgia's death penalty system 
noted that 50 of 228 black defendants received the death penalty after 
conviction for murdering a white person, while 2 of 64 white defendants 
received the death penalty for murdering a black person. Is there sufficient 
evidence at the 5% significance level to say that a higher proportion of 
blacks than whites receive the death penalty in Georgia when the victim 
is of the other race? 


5. Tversky and Gilovich, in their study on the “Hot Hand” in basketball 
(Chance, Winter 1989, page 20), found that Larry Bird hit a second free 
throw 48 of 53 attempts after the first free throw was missed, and hit a 
second free throw 251 of 285 attempts after the first free throw was made. 
Is there sufficient evidence at the 5% significance level to say that the 
probability Bird makes a second free throw is different depending on 
whether or not he made the first free throw? 
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6. The results of a double-blind study on whether AZT taken by pregnant 
HIV-positive women helps prevent the virus from being passed to the fetus 
Rave seemingly strong results (New England Journal of Medicine, Novem 
ber 1994). Of 240 women who took the drug, 8.3% of the children were 
infected with HIV, while of 240 women given a placebo, 25.5% of the 
children were infected. Is there sufficient evidence at the 19% significance 
level to say that AZT helps prevent the HIV virus from being passed from 
mother to child? 

7. The New York Times (April 2, 1993, page A12) reported that in a double- 
blind study of newly diagnosed HIV patients, 92% of 875 patients treated 
with AZT survived the initial 3 years, while 93% of 875 patients given a 
placebo were alive 3 years after the diagnosis. Is the difference in survival 
rates statistically significant? 


8, A government survey of income levels of men and women yielded the 
following numbers for the 7689 person random sampling. 
Earning Group 
Under $10,000 $10-20,000——$20-50,000 Over $50,000 
Men 383 7 2599 551 
Women a3 1371 1084 51 


a, Establish a 96% confidence interval estimate for the difference between 
the proportions of men and women who earn between $20,000 and 
$50,000, 

'b, Construct a 98% confidence interval estimate for the difference between 
the proportions of persons earning over $50,000 and those earning 
under $10,000 who are men. 

¢. Test the claim that a greater proportion of women than men earn under 

$10,000 at the .5% significance level. 

Is there a significant difference between the proportions of men and 

‘women earning between $10,000 and $50,000 at the 2.5% significance 

level? 

9. A researcher plans to investigate the difference between the proportion of 
psychiatrists and the proportion of psychologists who believe that most 
emotional problems have their root causes in patients’ childhoods. How 
large a sample should be taken (same number for each group) so as to be 
90% certain of knowing the difference to within +.03? 


a 


Study Questions 

1. The hypothesis tests in this chapter all began with the null hypothesis A, 
‘t, — 7 = 0. However, we can easily modify our procedure to handle Hg: 
‘, — 2 = k for some nonzero constant & In this case we simply use 
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6. as we did for confidence intervals. For 


example, suppose that 210 of 3600 alumni responded to a mail solicitation, 

while 476 of 2700 responded to a telephone appeal for funds. Does this 

support the claim that the proportion of successful telephone appeals 

exceeds the proportion of successful mail solicitations by more than .10? 

Test at the 2.5% significance level. 

. Three in-store surveys are taken: one before, one during, and one after a 
special promotion for tropical fruits. The proportions of customers express- 
ing interest in these products are 29 of 157, 37 of 165, and 40 of 152 for 
the surveys before, during, and after the promotion, respectively. 

a, At the 10% significance level, is the interest increase from before to 
during the promotion significant? From during to after the promotion? 
From before to after the promotion? 

b. What does the result obtained in part a say about the possibility that 
By and Py are significantly different when P, and P, are not, and p; and 
Pyare now 

If 


a - eee: 
AS, fa, at pe AAS, 
is it ever true that 


Lint: Make reasonable guess about a simple relationship between 1, 
and m, see whether } = ¥2 (p, + P,) holds true in this case when you 
plug in some arbitrary numerical values, and then try to show why this 
equation is true in general] 


Part 6 


CHI-SQUARE ANALYSIS 


In Chapters 19.and 20 we consider two types of problems involving 
similar analyses. The frst concerns whether or not some observed 
distributional outcome fits some previously specified pattern, Perfect 
“fits very rarely exist, and we must develop a procedure for measuring 
the significance of a loose fit. The second type of problem concerns 
whether two variables are independent or have some relationship 
Here we use the same kind of measurement as for the first type to 
Judge the significance of a loose fit with the theoretical pattern based 
on independence. The kinds and the variety of problems we will be 
considering are indicated by the following examples 

A geneticist might test whether inberited traits can be explained 
by a binomial distribution. A physicist might look to a Poisson 
distribution fo further understand alpba particle emissions. A psy- 
chologist might study buman intelligence patterns in terms of a 
normal distribution. Jn these and many other applications, the 
question arises as to how well observed data fit the pattern expected 
from some specified distribution. 

A pollster might look at whether or not voter support for a 
candidate is independent of the voters’ etbnic backgrounds. An 
efficiency expert might examine whether or not the likelibood of an 
accident is independent of the shift on which an employee works. 
A psychologist might test whether or not certain mental abnormali- 
ties are independent of socioeconomic background. 
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As can be seen, the procedure described in the Chapters 19 and 
20 bas a wide range of applications. Chi-square analysis is one of 
the most useful statistical techniques, both in cases where other tests 
are not applicable, and in cases where other tests may be unneces- 
sarily complicated. 


19 
CHI-SQUARE TEST FOR 
GOODNESS-OF-FIT 


‘Among the many theoretical distributions analyzed by statisticians are the 
binomial, the Poisson, the normal, and the Student-f, all of which we have 
made use of, In addition, numerous other distributions are expected or looked 
for in specific real-world situations. The critical question is whether or not an 
observed pattern of data fits some given distribution, A perfect fit cannot be 
expected, so we must be able to look at discrepancies and make judgments as 
to the goodness-of-fit. 

‘Our approach is similar to that developed earlier. There is the null hypothe- 
sis of a good fit, that is, the hypothesis that a given theoretical distribution 
correctly describes the situation, problem, or activity under consideration, Our 
observed data consist of one possible sample from a whole universe of 
possible samples. We ask about the chance of obtaining a sample with the 
‘observed discrepancies if the null hypothesis is really true. Finally, if the 
chance is too small, we reject the null hypothesis and say that the fit is not a 
good one. 

How do we decide about the significance of observed discrepancies? It 
should come as no surprise that the best information is obtained from squaring 
the discrepancy values, as this has been our technique for studying variances 
from the beginning. Furthermore, since, for example, an observed difference 
of 23 is more significant if the original values are 105 and 128 than if they are 
10,602 and 10,625, we must appropriately weight cach difference. Such weight- 
ing is accomplished by dividing each difference by the expected value. The 
‘sum of these weighted differences or discrepancies is called cht-square and is 
denoted as x? (x is the lower-case Greek letter chi): 


ge = pO ep? 


exp 


‘The smaller the resulting x?-value, the better is the fit. If the x2-value exceeds 
some critical value, we say there is sufficient evidence to reject the null 
hypothesis and to claim that the fit is poor. 

To decide how large a calculated x¢-value must be to be significant, that is, 
to choose a critical value, we must understand how x2-values are distributed. 
A x*-distribution is not symmetrical and is always skewed to the right. Just as 
with the Fdistribution, there are distinct x?-distributions each with an associ- 
ated df value (number of degrees of freedom). The larger the df value, the 
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closer the x2distribution is to a normal distribution. Note, for example, that 
squaring the often used zscores 1.645, 1.96, and 2.576 results in 2.706, 3.841, 
and 6.635, which are entries found in the first row of Table D, the y#-distribu- 
tion table, in the Appendix. 


A large value of x? may or may not be significant—the answer depends on 
which x4distribution we are using. As with the distribution, in Table D in the 
Appendix we give critical y2-values for only the more commonly used percent- 
‘ages or probabilities. To use the x2-distribution for approximations in good- 
ness-of-fit problems, the individual expected values cannot be too small, An 
often-used rule of thumb is that no expected value should be less than 5. 


GOODNESS-OF-FIT TEST FOR PRIOR 
DISTRIBUTION 


Example 19.1 In a recent year, at the 6 P.M. time slot, television channels 2, 3, 
————_ 4, and 5 captured the entire audience with 30%, 25%, 20%, and 
25%, respectively. During the first week of the next season, 500 
viewers are interviewed. 
a. If viewer preferences have not changed, what number of 
persons is expected to watch each channel? 


Answer: 30500) = 150, .25(500) = 125, .20(500) = 100, and 
.25(500) = 125, so we have these results: 


Channet 
2 3 4 


Expected number 150 | 125 | 100 | 125 


ChiSquore Test for Goodnessoffit 215 


b. Suppose that the actual observed numbers are as follows: 


Channel 
2 3 4 s 
Observed number 139 | 138 | a2 | an 


Do these numbers indicate a change? Are the differences signifi- 
cant? 


Answer: We calculate: 
y= rset 
exp 


= 039 = 150%, (138 - 125% , (112 - 100) 


125 100 


Is 5.167 large enough for us to reject the null hypothesis of a good 
fit between the observed and the expected numbers? To use Table 
D in the Appendix, we must decide upon an a-tisk and then 
calculate the df value. The number of degrees of freedom for this 
problem is 4 — 1 = 3. (Note that, while the observed values of 
139, 138, and 112 can be freely chosen, the fourth value, 111, is 
absolutely determined because the total must be 500; thus df'= 3.) 


Fail to reject H 


Reject Ho 


6251 Pa 


Thus, for example, at a 10% significance level, with df = 3, the 
critical x2 is 6.251. Since 5.167 < 6.251, there is not sufficient 
evidence to reject H,. 
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Fall to reject Ho 


Reject Ho 


4842 ee 


However, with df= 3 and a 20% significance level, the critical x2 
is 4.642. Since 5.167 > 4,642, there is sufficient evidence to reject 
H,, At the 20% level we would conclude that the fit is not good, 
and that viewer preferences have changed. 


Trample 192 A geneticist claims that four species of fruit flies should appear in 
the ratio 1:3:3:9. Suppose that a sample of 4000 flies contained 
226, 764, 733, and 2277 flies of each species, respectively. At the 
10% significance level, is there sufficient evidence to reject the 
geneticist’s claim? 


Answer: Since 1 + 3 + 3 + 9 = 16, according to the geneticists 
the expected number of fruit flies of each species is as follows: 


1 


é 3 2 
74000) = 250, 3.4000) = 750, 
& 4000) = 750, 2 4000) = 2250 
16 16 


‘We calculate chi-square: 


x2 = GH= BOP , 764 — 750%, 733 — 750% 


250 730 750 
(227 ~ 2250 
+ 27 


With 4 — 1 = 3 degrees of freedom and a = .10, the critical x? 
value is 6.251. Since 3.27 < 6.251, there is not sufficient evidence 
to reject H,. At the 10% significance level, therefore, the geneti- 
cist’s claim should not be rejected. 
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GOODNESS-OF-FIT TEST FOR UNIFORM 
DISTRIBUTION 


Eronple 193 A grocery store manager wishes to determine whether a certain 
product will sell equally well in any of five locations in the store, 
Five displays are set up, one in each location, and the resulting 
numbers of the product sold are noted. 


Location 
1 2 3 4 5 
‘Actual number sold 3 |» | 2 | uw | #8 


Is there enough evidence that location makes a difference? Test at 
both the 59% and 10% significance levels. 


Answer: 4 (otal of 43 + 29 + 52 + 34 + 48 = 206 units were 
sold. If location doesn't matter, we would expect 206/5 = 41.2 
units sold per location (uniform distribution). 


Location 
1 2 3 4 5 


Expected number sold | 41.2 | 412 | a2 | 422 | 412 


43412 , 9 412" , 52 412) 
a2 a2 ae 
A= 412» | (48 = 412 
aR cy itd 


Thus x? = 


As in the previous examples, the number of degrees of freedom 
is the number of classes minus 1; that is, df = 5 ~ 1 = 4. 


#H: good fit to uniform distribution 
Fall to reject Hy 


Reject Hy 


9488 Ps 
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At the 5% level, with df = 4, the critical y2-value is 9.488, while 
at the 10% level the critical value is 7.779. Since 8.903 < 9.488 but 
8,903 > 7.779, there is sufficient evidence to reject H, at the 10% 
evel, but not at the 5% level. If the grocery store manager is 
willing to accept a 10% chance of committing a Type I error, then 
there is enough evidence to claim that location makes a differ- 
ence. 


‘A petfood manufacturer runs an experiment to determine 
whether three brands of dog food are equally preferred by dogs. 
In the experiment, 150 dogs are individually presented with three 
dishes of food, each containing a different brand, and their 
choices are noted. Tabulations show that 62 dogs go to brand A, 
48 to brand B, and 45 to brand C. Is there sufficient evidence to 
say that dogs have preferences among the brands? Test at the 2.5% 
significance level. 


Answer: If dogs have no preferences among the three brands, we 
would expect 150/3 = 50 dogs to go to each dish. Thus: 


With 3 — 1 = 2 degrees of freedom and a = 025, the critical 
xe-value is 7.378. Since 4.36 < 7.378, there is not sufficient 
evidence to say that dogs have preferences among the three given 
brands. (Of course the company that manufactures brand A will 
continue to claim that more dogs prefer its food!) 
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GOODNESS-OF-FIT TEST FOR BINOMIAL 
DISTRIBUTION 


Example 195 Suppose that a commercial is nun once on television, once on the 

———— radio, and once in a newspaper. The advertising agency believes 
that any potential consumer has a 20% chance of seeing the ad on 
TY, a 20% chance of hearing it on the radio, and a 20% chance of 
reading it in the paper. In a telephone survey of 800 consumers, 
the numbers claiming to have been exposed to the ad 0, 1, 2, oF 
3 times are as follows. 


Exposures to Ad: 
0 1 2 3 

Observed number 

of people 44 | 320 | 35 2 


At the 196 significance level, test the null hypothesis that the 
number of times any consumer saw the ad follows a binomial 
distribution with = 2. 


Answer: The complete binomial distribution with # = .2 and 
n= 3 is as follows: 
PO) = (8 = 512 
M2K8) = 384 
M2P(8) = 096 
PG) = (2) = 008 


Multiplying each of these probabilities by 800 gives the expected 
number of occurrences: 5121800) = 409.6, .384(800) = 307.2, 
.096(800) = 76.8, .008(800) = 6.4 


Pay = 
PQ) 


Exposures to Ad 
eee et ee 
Expected number 5 
= 4006 | 3072 | 768 | 64 
Thus 
2 = G34 = 4096" , (329 - 3072) | 35 — 768" 
- 1096 2 768 
4 B=O _ a6 


Ot 


Chi-Square Analysis 
‘Hg: good fit to binomial distribution with t= .2 


Fall to reject H, 


Reject H, 


11.94 2 


With df = 4— 1 = 3 and @ = 01, the critical x?-value is 11.34. 
Since 28,776 > 11.34, there és sufficient evidence to reject H, and 
to conclude that the number of ads seen by each consumer does 
not follow a binomial distribution with a = .2. 


Four commercial flights per day are made from a small county 
airport, The airport manager tabulates the number of on-time 
departures each day for 200 days. 


Number of On-Time Departures 
° 1 2 3 4 
3 | 3% | 2 | 96 | 2 


Observed 
number of days 


At the 5% significance level, test the null hypothesis that the daily 
distribution is binomial. 


Answer: We are not given a value of 7 to test against, so we use 
the observed sample to find a proportion p. The number of 
on-time departures is seen to be 13(0) + 361) + 72(2) + 56(3) 
+ 23(4) = 440. Since there were 4(200) = 800 flights, we have a 
proportion of 
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The binomial distribution with = .55 and n= 4 is 
PCO) = (45) = 0410 
PQ) = 4(55X.45)3 = 2005 
PQ2) = 65545? = 3675 
PQ) = 4(55)5C.45) 2995, 
PCA) = (55) 0915 
Multiplying each of these probabilities by 200 days gives the 


expected numbers of occurrences: .0410(200) = 8.2, 2005(200) = 
40.1, .3675(200) = 73.5, .2995(200) = 59.9, .0915(200) = 18.3 


Number of On-Time Departures, 
0 1 2 3 4 
Expected 
fnutaber of days 82 | 41 | 735 | 599 | 183 


We calculate chi-square: 


a = G3 82" , (36 -40n? , 72-7357 , (56 - 599 
- a2 or 735 399 
(23-1837 _ 
+O2 4721 


H..: 8008 fit with a binomial 


Fall to roject Mo 


Reject Ho 


7815 a 


In this case the sample gives not only the total number, 200, but 
also the proportion, .55. Thus the number of degrees of freedom 
is the number of classes minus 2: df= 5 ~ 2 = 3. With a = .05, 
we have a critical x2-value of 7815. Since 4.721 < 7.815, there is 
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‘not sufficient evidence to reject H,. The observed data do not 
differ significantly from what would be expected for a binomial 
distribution. 


GOODNESS-OF-FIT TEST FOR POISSON 
DISTRIBUTION 


Example 19.7 


In a hospital study of 450 patients with a particular form of cancer, 
the following data shows the number of these patients who 
survived for each given number of years. 

Survival Years 


° 1 2 3. 4 or more 


Observed number 
of patients 


o Lwlsle]« 


‘Test the null hypothesis that the distribution follows a Poisson 
distribution with p = 2.1. Assume a 2.5% significance level. 


Answer: The Poisson probabilities are 


PCO) = e-2! = 122 

PQ) = 2.1e72) = 257 

PQ) = SP on = 270 
fe 

pe) = GR 


P(A or more) = 1 ~ (122 + 257 + 270 + 189) = 162 


Multiplying each of these probabilities by 450 gives the number 
of patients expected to survive each designated number of years: 
122(450) = 54.9, .257(450) = 115.65, 270(450) = 121.5, .189(450) 
= 85.05, .162(450) = 72.9. 


Survival Years 
Cee: 23. dormore 
of pat bs 349 | 115.65 | 121.5 | 85.05 | 72.9 
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Thus 
2 = GO= 549 , (10 = 11565", 425 ~ 12152 
39 T1865, Tas 
¢ B= 8505" , 67-729 _ 1 459 


3505 729 
H,; good fit to Poisson distribution with = 2.1 


Fail to reject H, 


Reject Hy 


11.143, ® 


The number of degrees of freedom is the number of classes 
minus 1: df= 5 — 1 = 4. With @ = .025, the critical y*-value is 
11.143, Since 1.430 < 11.143, there is not sufficient evidence to 
reject Hy. It should not be disputed that a Poisson with w= 2.1 
describes the distribution of survival years of patients stricken with 
this cancer. 


‘An editor ct ¢ number of typing errors per page on 12! 
Example 198 An editor checks the number of 5 
pages of a manuscript with the following results: 


‘Number of Errors 
0 1 2 3 4 3 
Observed 2 | 3 | w | wf] 3 | 3 


+ of pages 


Do these data follow a Poisson distribution? Test at the 10% 
significance level. 
Answer: In this case the mean is not given, so it must be estimated 


from the sample. The total number of errors is 200) + 38(1) + 
24(2) + 18(3) + 10(4) + (5) = 195. Thus the average per page 
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of sample is 195/125 = 1.56. The Poisson probabilities are as 
follows: 


PO) = e186 = 210 
PQ) = 1560-15 = 328 


PQ) = = 256 
PG) = = 133 
Pa) = = 052 


(210 + 328 + 256 + 133 + 052) = 021 


PG or more) = 


Mukiplying by 125 gives the expected number of pages with 
various numbers of mistakes: .210(125) = 26.25, .328(125) = 41, 


.256(125) = 32, 133125) = 16.625, .052(125) = 6.5, .021(125) = 
2.625.! 
Number of Errors: 
Ce ee eee eee? 
Expected 
corps — (225 | 41 | 32 [ 16625] 65 | 2025 
= 29 = 2625) , B= 49? , (24 - 3292 
Tees a ae 
4 GB 16625" , 03 ~ 65% , G ~ 2625» 
16.625 65 2625 
= 9175 


H,: good fit with a Poisson distribution. 


Since the sample is giving both the total number of pages (125) 
and the average errors per page (1.56), the number of degrees of 
freedom is the number of classes minus 2: df= 6 - 2 = 4, With 
@ = .10 the critical x-value is 7.779. Since 9.175 > 7.779, there is 
sufficient evidence at the 10% level to reject H, and to claim that 
the typing errors per page do not follow a Poisson distribution. 


2Note this does violate the previously mentioned rule of thumb that every cell must have an 
expected value of at least 5. Some statisticians accept one lower value; alternatively, we could 
combine the last rwo cells into one cell with 2 higher expected value. 
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GOODNESS-OF-FIT TEST FOR NORMAL 
DISTRIBUTION 


‘A purchasing agent weighs a sample of 225 bags of rice labeled 
as containing 50 pounds each with the following results. 


‘Weight (Pounds) 


Under 4925 49.75 50.25 Over 
49.25 4975 ~$0.25 ~50.75 50.75 


Observed 
‘number of bags 


=lal~[s» 


‘Test the null hypothesis that the data follow a normal distribution 
with p= 50 and o = 05. Assume a 2.5% and then a 1% 
significance level. 


Answer: The =scores for 50.25 and 50.75 are, respectively, 


BB Deas snd BB—Bais 


05 Os 


Similarly, 49.75 and 49.25 have zscores of ~0.5 and ~1.5. Using 
Table A, we find these values: 


800 
07 207 
“ee 008 
Ca a ee Pounds 
ats 25 os 1s sy 


Multiplying each probability by 225 gives the expected numbers 
of bags for the corresponding weight ranges: 


.0668(225) = 15.03, .2417(225) = 54.38, .3830(225) = 86.18. 
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‘Weight (pounds) 


Under 4925 49.75 50.25 Over 
4925 49.75 —5025 -50.75 50.75 


ibe ol bass 1503 | 54.38 | 86.18 | 5438 | 15.03 


503)? , (61 ~ 5438)? , (70 — 8618)" 

1503 3438 O18 
(59 ~ 5438)? , (10 ~ 1503) _ 

+8 tases 12533 


H,: good fit to a normal distribution with = 50 and o = 0.5 


Fail to reject H, 
Rojoct Hy 
11.443 e 
Fait to reject My 
Reject H, 
13277 # 
With df= 5 — 1 = 4 and a = .025, the critical x2-value is 11.143, 
while @ = 01 results in a critical value of 13.277. Since 12.533 > 


11.143 and 12.533 < 13.277, we have sufficient evidence to reject 
H, at the 2.5% significance level but not at the 1% level. Thus, if 
the purchasing agent is willing to accept a 2.5% a-risk, she should 
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conclude that the distribution of weights of rice bags is not 
normal. 


‘Suppose that the assembly times for a sample of 300 units of an 
electronic product have mean = 84, standard deviation o = 3, 
and the following distribution 


Assembly Time (minutes) 
<8 7881 BIBL BEET 87-90 290 


Observed 


oeorums LS 1] » | 7 | © | #@ | 35 


At the 1% significance level, test the null hypothesis that the 
distribution is normal. 


Answer: Here we must use the sample mean, 84, and the sample 
standard deviation, 3. Table A gives these values: 


Multiplying by 300 units yields the following expected numbers 
of units: 
Assembly Time (minutes) 
<8 7881 8184 8487 87-90 >90 


Expected 
seer of units LOS | 407 | 10239 | 102.39 | 40.77 | 66 
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5 — 684", G9— 4077", (87 ~ 10239) 
684 4077 
+ G8= 4077" 
4077 


Thus x? 


Since we are using three measures from the sample (size, mean, 
and standard deviation), the number of degrees of freedom equals 
the number of classes minus 3: df= 6 ~ 3 = 3. With a = .01, the 
critical x*-value is 11.34. Since 23.540 > 11.34, there is sufficient 
evidence to reject H, and to conclude that the distribution of 
assembly times is not normal. 


FITS THAT ARE TOO GOOD 
‘We have been using only small, upper-tail probabilities of the chi-square 


distribution, 


ad 


Trompe 1 


In some cases, however, We are interested in the probability that 


juare value is less than or equal to a critical value, 


Suppose your mathematics instructor gives, asa homework as- 
signment, the problem of testing the faimess of a certain die. You 
are asked to roll the die 6000 times and to note how often it comes 
up 1, 2, 3, 4, 5, and 6. Tossing the die soon becomes tiresome, 80 
you decide to invent some “reasonable” data, Being careful to 
make the total 6000, you write: 


Number on Die 
1 2 3 4 3 6 
Observed gys8 | 991 | 1010 | 990 | 1013 | 1008 


Hg: good fit to uniform distribution; that is, die is fair 


‘The “expected” values are all 1000, so 


a. What is the conclusion if a = .01? .05? 10? 


Answer: With df = 6 ~ 1 = 5, the critical x?-values are 15.086, 
11.070, and 9.236. Since 0.658 is less than each of these, there is 
‘not enough evidence to claim that the die is unfair. 
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'b, What is the conclusion if we're willing to accept a = .90? .95? 
I? 98 


Answer: These give critical x?-values of 1.610, 1.145, 0.831, and 
0.554, respectively, so only if we accept a 99% chance of commit- 
ting a Type I error can we reject H, and question the faimess of 
the die. Even if the die is fair, the probability of obtaining such a 
good-fiting sample is extremely small. Your instructor would 
reasonably conclude that you simply made up the data. Your 
results are too good to be true! 


Statisticians have shown that some of Gregor Mendel's experimental results 
are too good to be true. No one disputes his conclusions, but there are strong 
indications that the monks working for him falsified some data to conform with 
the intended conclusions. 


EXERCISES 


1. A college alumni office states that 20% of the graduates eventually become 
lawyers; 25%, doctors; and 35%, corporate executives. The remaining 20% 
are spread among a variety of professions. A new survey taken of 625 
graduates tumed up 110 lawyers, 140 doctors, 250 corporate executives, 
and 125 others. Is there sufficient evidence that the percentages quoted by 
the alumni office have changed? Test at the 5% significance level, 


2. A highway superintendent states that five bridges into a city are used in 
the ratio 2:3:3:4:6 during the morning rush hour. A highway study of a 
sample 6000 cars indicates that 720, 970, 1013, 1380, and 1917 cars use the 
five bridges, respectively. Can the superintendent's claim be rejected if a 
= 05? 025? 

3. A study (The Lancet, October 22, 1994, page 1137) of accident records at 


a large engineering company in England reported the following number 
Of injuries on each shift for 1 year: 


shit 
‘Morning Afternoon Night 
Number of injuries 1372 1578 1686 


Is there sufficient evidence to say that the numbers of accidents on the 
three shifts are not the same? Test with a = .05. 
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4. A building inspector inspects five major construction sites per day. In a 
sample 200 days, the numbers of sites passing inspection are as follows: 
Number of Sites Passing 

03004 5 

Frequency s [| «| ofa 


At a 5% significance level can we conclude that the distribution is de- 
scribed by a binomial with = = 8 

5. A sampling of 500 major corporations is examined to determine the 
number of women among the top three executives. Using a = .01, test the 
null hypothesis that the results tabulated below follow a binomial distribu- 
tion. Note that you must use the observed data to approximate the 
population proportion =. 


Number of Women 
‘Among Top 3 Executives 


° 1 2 3 


Number of corporations[ 245 | 170 | so | 35 


6 A real estate agent tabulates the number of home sales per week during a 
sample of 83 weeks. 


Sales per Week 
0 1 2 3 4° Sor more 
Frequency 3 [ 3 [wl a [e6 ls 


‘Test the null hypothesis that the distribution is Poisson with jt = 2.4. Use 
a= 05. 

7. A company efficiency expert records the number of wrong connections 
per hour by a new telephone switchboard operator. For a sample of 50 
hours, the results are as follows: 

Mistakes per Hour 
0 1 2 3 4 


Frequency 3 « | 7 | 8 4 
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Test the null hypothesis that the distribution is Poisson. Assume a 10% 
significance level, and note that you must use the observed data to 
approximate p. . 

8. Consider the following distribution of salaries of 100 junior level execu- 
tives: 


Salary ($1000) 
<40 4050 5060 60-70 >70 


Frequency w | 32 | 33 | 15 | 10 


‘Test the null hypothesis that the distribution is normal with a mean of 55 
and a standard deviation of 10. Use a = .025, 


Study Questions 
1, Consider three of our often used 2scores: 1.645, 1.96, and 2,576, Square 
these values, and look for the results in Table D, the chi-square distribution 
table, in the Appendix. Conclusion? 
2. Show that 


plobs— exp? _ slobs _ 
exp exp 


where Nis the total number of elements in the sample. (Hint: © exp = 
Lobs = M 
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CHI-SQUARE TEST FOR 
INDEPENDENCE 


In each of the goodness-of-fit problems in Chapter 19, a set of expectations 
was based on some assumption about how the distribution should turn out. 
We then tested whether an observed sample distribution might reasonably 
have come from a larger set based on the assumed distribution. 

In many real-world problems, however, we want to compare two or more 
observed samples without any prior assumptions about an expected distribu- 
tion, In what is called a test of independence, we ask whether the two or more 
samples might reasonably have come from some one larger set. For example, 
do students, professors, and administrators all have the same opinion concern- 
ing the need for a new science building? Do nonsmokers, light smokers, and 
heavy smokers all have the same likelihood of being eventually diagnosed with 
cancer, heart disease, or emphysema? 

‘We classify our observations in two ways, and then ask whether the 
‘two ways are independent of each other. For example, we might consider 
several age groups, and within each group ask how many employees show 
various levels of job satisfaction. The null hypothesis would be that age 
‘and job satisfaction are independent; that is, that the proportion of employees 
expressing a given level of job satisfaction is the same no matter which 
age group is considered. A sociologist might classify people by ethnic ori- 
gin, and within each group ask how many individuals complete various 
levels of education. The null hypothesis would be that ethnic origin and 
education level are independent; that is, that the proportion of people achiev- 
ing given education level is the same no matter which ethnic group is 
considered. 

Our analysis will involve calculating a table of expected values, assuming 
the null hypothesis about independence is true. We then compare these 
expected values with the observed values, and ask whether the differences 
are reasonable if H,, is true. The significance of the differences is gauged 
by the same x-value of weighted squared differences used in Chapter 19. 
‘The smaller the resulting y#-value, the more reasonable is the null hypothesis 
of independence. If the y@-value exceeds some critical value, we can say 
that the evidence is sufficient to reject the null hypothesis and to claim 
that there i some relationship between the two variables or methods of 
classification. 
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Example 20.1 A beef distributor wishes to determine whether there is a relation- 

~~ ship between geographic region and cut of meat preferred. If 
there is no relationship, we will say that beef preference is 
independent of geographic region. Suppose that in a random 
sample of 500 consumers, 300 are from the North while 200 are 
from the South. Of these 500 persons, 150 prefer cut A, 275 prefer 
cut B, and only 75 prefer cut C 


Geographic Region 
Nonth South 
Cut A 150 
Beet =—— >= 4 
preference Out B 275 
cue 75 
300 200 


a. If beef preference is independent of geographic region, how 
would we expect this table to be filled in? 


Answer: Since 300/500 = .6 of the sample is from the North, we 
would expect .6 of the 150 consumers favoring cut A to be from 
the North. We calculate .6(150) = 90 or (300X150)/500 = 90. 
Similarly, we would expect .6 of the 275 consumers favoring cut 
B to be from the North. We calculate .6(275) = 165 or 
(300X275)/500 = 165. Continuing in this manner we fill in the 


table as follows: 
North South 
cut A % 60 150 
peed Cut B 165 0 25 
cue 45 30 5 
300 200 


Note that we didn't have to perform all the calculations to fill in 
the table. After determining 90 and 165, no choices were left for 
the remaining values because the row totals and column totals 
were already set. Thus, in this problem there are two degrees of 
freedom, that is, df= 2 
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b. Now suppose that in the actual sample of 500 consumers, the 
observed numbers were as follows: 


North South 
Cut a 100 50 150 
Observed cut B 150 125 2s 
cut © 50 25 ms 

300 200 


Are the differences between the expected and observed values 
lasge or small? If the differences are large enough, we will reject 
independence and claim that beef preference is related to geo- 
graphic location. 


Answer: We calculate the ? value: 


a a yobs = exp? 
* exp 
= G00 = 90) , (50 = 60 , 150 ~ 165)? | (125 - 1102 
30 0 165 710 
(50 = 45 , 5 ~ 307 _ 
+e + SF 7578 


Is 7.578 large enough for us to reject the null hypothesis of 
independence? 


Fail to reject H, 


Reject Hi, 


9210 # 


Looking at the x? table, with df = 2, we see that, with a = .01, 
the critical x?-value is 9.210. Since 7.578 < 9.210, there is not 
enough evidence to reject H,- 
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Fail to reject H 


7378 xe 


However, with a = .025, the critical x?-value is 7.378; and since 
7.578 > 7.378, there és sufficient evidence to reject H,. Thus, we 
can conclude that at the 2.5% significance level there is a relation- 
ship between beef preference and geographic region, but at the 
1% significance level the evidence is not sufficient to reject the 
null hypothesis of independence. 


CONTINGENCY TABLES 


Example 20.2 In a nationwide telephone poll of 1000 adults, representing Demo- 

———— crats, Republicans, and Independents, respondents were asked if 
their confidence in the U.S. banking system had been shaken by 
the Savings and Loan crisis. The answers, cross-classified by party 
affiliation, are given in the following contingency table. 


Observed: 
Yes No___No opinion 
oom [ms | m | 
Republicans 150 165 | 35 
Independents 78 105 | 2» 


Test the null hypothesis that shaken confidence in the banking 
system is independent of party affiliation. Use a 10% significance 
level. 


Answer: The above table gives the observed results. To determine 
the expected values, we must first determine the row and column 
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totals, which were given in the Example 20.1. We obtain the 
following values: 


Row totals: 175 + 220 + 55 = 450, 150 + 165 + 35 = 350, 
75 + 105 + 20 = 200 

Column totals: 175 + 150 + 75 = 400, 220 + 165 + 105 = 490, 
55+ 35+ 20= 110 


Yes No___Noopinion 

Democrats | 450 

Republicans | 350 

Independents | 200 
00 BO 110 


To calculate, for example, the expected value in the upper left 
box, we can proceed in any of several equivalent ways. First, we 
could note that the proportion of Democrats is 450/1000 = .45; 
and so, if independent, the expected number of Democrat yes 
responses is .45(400) = 180. Instead, we could note that the 
Proportion of yes responses is 400/100 = .4; and so, if inde- 
pendent, the expected number of Democrat yes responses is 
4(450) = 180. Finally, we could note that both these calculations 
simply involve (450400)/1000 = 180. 

In other words, the expected value of any bax can be calcu- 
lated by multiplying the corresponding row total times the appro- 
priate column total and then dividing by the grand total. 
‘Thus, for example, the expected value for the middle box, which 
corresponds to Republican no responses, is (350X490)/1000 
= 1715. 

Continuing in this manner, we fill in the table as follows: 


ace 
Ss 

Democrats | 180 220.5 49.5 450 

OE (eras ea ee 


s]8 
8 


0 
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772 non-heart-related admissions, 175 had vertex baldness. Is this evidence 
sufficient at the 5% significance level to say that there is a relationship 
between heart attacks and vertex baldness? 

. Following are the results of a study (New England Journal of Medicine, 
1988) to determine whether taking an aspirin every other day reduces the 
risk of heart attacks in men: 


Aspirin Placebo 
Fatal attack 5 18 
Nonfatal attack » m 
No heart attack 10.933 10,845 


Is there evidence of a relationship between taking aspirins and the risk of 
heart attacks? Test at the 5% significance level 
. A4 year study, reported in The New York Times, of men more than 70 years 
old analyzed blood cholesterol, and noted how many men with different 
cholesterol levels suffered nonfatal or fatal heart attacks. 


low Medium High 
Cholesterol_Cholesterol_Cholesterol 


Nonfatal hear attacks | 29 7 18 


Fatal heart attacks 19 20 9 


Is there evidence of a relationship? Does level of cholesterol seem to 
predict risk of heart disease for men over 70 years old? Test at the 10% 
significance level, 

|. Does sea bathing lead to illness? For a sampling of 1112 bathers in four 
English resort areas (The Lancet, October 1, 1994, page 906), the following 
table shows exposure to varying amounts of fecal material in the water 
versus number of cases of gastroenteritis reported. 


Fecal Streptococci per 100 ml 
None 0-34___—35-69___over 70, 


Cases of gastroenteritis 7 4 “4 ” 


No related illness 548 m3 125 ca 


Is there evidence of a relationship between amount of fecal material in the 
‘water and reported gastroenteritis illness? Test at the 1% significance level. 
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5. Is hair-loss pattem related to body mass index (kg/m?)? One study (Journal 
of the American Medical Association, February 24, 1993, page 1000) of 769 
men showed the following numbers: 


Hair-Loss Pater 
None Frontal Vertex 
<2 | 137 2 40 
Body-mass index 25-28 | 218 Ey 0 
> | 153 0 68 


At the 596 significance level is there a relationship between hair-loss pattern 
and body-mass index? 

6. Is there a relationship between fitness level and smoking habits? A study 
(New England Journal of Medicine, February 25, 1993, page 535) cross- 
classified individuals in four categories of fitness and four categories of 
smoking habits: 


Fitness Level 

Low Medium High 

Never smoked 13 3 no 159 
Former smokers 19 135 12 190 
1 10 9 cigarettes daily 7 a 86 65 
10 cigarettes daily 181 12 14 3 


Is there a relationship between fitness level and smoking habits? Test at 
the 2.5% significance level. 


Study Questions 
1, Suppose that in a telephone survey of 800 registered voters, the data are 
cross-classified both by sex of respondent and by respondent's opinion on 
an environmental bond issue. 
Bond Issue 
For Against 
Men 450 150 


Women 160 40 
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a. Determine the x? value for these data. 
b. Suppose that instead we use the approach given in Chapter 18 to test 
the significance of the difference between the proportions 
By = 450/600 = .75 and J, = 160/200 = 8 
Let B = (450+160)/800 = .7625 and calculate 


=| 7625K23754, +3 ) 


¢. Find the observed z-value which equals (.75 ~ 8/04 
d. Show that the square of this z-value is precisely equal to the x? value 
determined in part a 

It can be shown more generally that the x? value for a two-by-two table 
is equal to the squared observed 2-value for the difference in proportions, 
In this case we can determine the p-value for a x? value by taking the 
square root of the x? value and then using the normal distribution table. 

. A survey of college students focuses on a cross-classification by year in 
school and positive or negative feelings about the college experience, 


Year in School 

Freshman Sophomore _Junior___ Senior 

Positive 105 100 % Bs 
Negative 35 0 30 25 


a. Determine the value of x2. 
b. There are three degrees of freedom in this example. Suppose the table 
is split into the following three smaller tables, each having a single 
degree of freedom: 


Fresh. Soph. Jun. Sen. Undercl. Uppercl 
Pos. | 105 | 100 Pos | 90 | 125 Pos.| 205 | 215 
Neg.| 35 | 50 Neg. | 30 | 25 Neg} 85 | 55 


Determine the three x? values, one from each of these tables. 

¢. Show that the x? value of the original table is equal to the sum of the 
three x? values calculated. This illustrates the additive property of the 
X¢ statistic. 


Part 7 


REGRESSION 


Many decisions are based on a perceived relationship between 
two variables. For example, a company's market share may vary 
directly with advertising expenditures. A person's blood pressure 
‘may increase or decrease inversely as less or more bypertension 
‘medication is taken. A student's grades will probably go up and 
down according to the number of hours of studying time per week, 

Two questions arise. First, bow can the strength of an apparent 
relationship be measured? Second, bow can an observed relation- 
ship be put into functional terms? For example, a real estate broker 
not only might wish to determine whether a relationship exists 
between the prime rate and the number of new homes sold in a 
month, but also would find useful an expression with which to 
predict the number of house sales given a particular value of the 
prime rate. 

Chapters 21 and 22 deal with the related topics of scatter dia- 
grams, linear regression, predictions using the regression line, 
correlation coefficients, and hypothesis tests for correlation. 
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LINEAR REGRESSION 


Our studies so far have been concerned with measurements of a single 
variable such as the mean or proportion. However, many important applica~ 
tions of statistics involve examining whether two or more variables are related 
to one another. For example, is there a relationship between the smoking 
histories of pregnant women and the birth weights of their children? Between 
SAT scores and success in college? Between amount of fertilizer used and 
amount of crop harvested? Between price and numbers of bedrooms and 
bathrooms in new homes? 


SCATTER DIAGRAMS 


Suppose a relationship is perceived between two variables X and Y, and we 
graph the pairs (xy). The result, called a scatter diagram, gives a visual 
impression of the existing relationship between the variables. In this chapter 
we ask whether the relationship can be reasonably explained in terms of a 
linear function, that is, one whose graph is a straight tine. 

For example, we might be looking at a picture such as this: 
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Trample 211 The ages and salaries (in thousands of dollars) of four executives 
in a business firm are as follows: 


Ag: 385347 
Salary: 45 86 S58 OL 


Plotting the four points (38,45), (53,86), (42,58), and (47,61) gives 
the following scatter diagram: 


(63.86) * 


+ (47,61) 
(42,58) + 


(38,45) + 


‘What is the best fitting straight line that can be drawn above? 


Answer: On the basis of our experiences with measuring vari- 
ances, by best fitting we will mean the straight line that minimizes 
the sum of the squares of the differences between the observed 
values and the values predicted by the line. 
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Using the expression y,= + m{x,-¥, from the preceding section, and, 
again, the fact that am? + bm + c has its lowest point when m = —b/2a, we 
can show algebraically: 


Lay sat mdg tet xe and Latest tad t.tad 
‘Then the equation of the regression line, as before, is 
¥ =F + mx-*) 


INTERPRETATION OF SLOPE AND 
PREDICTIONS USING REGRESSION LINE 


Example 21,2 An insurance company conducts a survey of 15 of its life insurance 

——~——~_ agents. The average number of minutes spent with each potential 
customer and the number of policies sold in a week are noted for 
each agent. Letting X and Y represent the average number of 
minutes and the number of sales, respectively, we have: 


X 25 23 30 25 20 33 18 21 22 30 2% 2% 27 2 20 
¥ 10 11 14 12 8 18 9 10 10 15 11:15 12 14 


Find the equation of the bestfitting straight line for the data 


Answer: Plotting the 15 points (25,10), (23,11), .. ., (20,11) gives 
an intuitive visual impression of the relationship: 


of policies 


Minutes per potential customer 
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This scatter diagram indicates the existence of a relationship that 
appears to be linear; that is, the points lie roughly on a straight 
line. Furthermore, the linear relationship is positive; that is, as one 
variable increases, so does the other (the straight line slopes 
upward). 

To determine the equation of the bestfitting line, we first 
calculate: 


Ex = 25 +23 + 30+...+ 20 = 375 

By = 10+ +14 +... +11 = 180 

Ex? = 252 + 22 + 308 +...+ 202 = 9639 

‘Zay = 25(10) + 2K11) + 3014) +... + 20011) = 4645 


Then n= 15, s0 
gett .3Bn ee 
ya Baw, 5e te Ben 
and 
= 29-8) =16. 
ie aa) 264 ~ 05492 


The regression line is given by 
yf =F + mx — ) = 12 + 0549%x — 25) = 05492x ~ 173 


5 ‘Minutes per potential customer 


Thus, for example, we might predict that agents who average 24 minutes 
per customer will average 0.5492(24) — 1.73 = 11.45 sales per week. We also 
note that each additional minute spent seems to produce an average 0.5492 
extra sale. 


254 — Regression 
Example 213 Following are advertising expenditures and total sales with regard 
——~—— to six detergent products: 


Advertsing ($1000): x 23 5.7 48 73 59 62 
Total sales ($1000): y 77 105 96 118 102 95 


a. Find the equation of the regression line. 


Answer: We calculate Ex = 32.2, By = 593, Zxy = 3288.6, and 
‘Ex? = 187.36 which give ¥ = 5.367, j = 98.833, and 


‘Thus 
Y = 98.833 + 7.293x — 5.367) = 7.293x + 59.691 
b. Predict the total sales if $5000 is spent on advertising. 


Answer: Using the equation of the regression line, the resulting, 
total sales will be 7.293(5) + 59.691 = 96.156 thousands of dollars 
($96,156) 


. Interpret the slope 


Answer: The slope of the regression line indicates that every extra 
$1000 spent on advertising will result in $7293 of added sales. 


EXERCISES 


1. AS reported in The New York Times (September 21, 1994, page C10), a 
study at the University of Toronto determined that, for every 10 grams of 
saturated fat consumed per day, a woman's risk of ovarian cancer rises 
20%. What is the slope of the appropriate regression line, and what does 
it mean 


2, Jay Bennett (Chance, Winter 1995, page 38) calculated the regression line 
for average 1991 SAT scores (total math plus verbal) versus number of 
dollars spent per student in 1991 for New Jersey school districts and 
obtained a slope of 0.0227 and a intercept of 707. 

a. Find the equation of the regression line. 

b. Interpret this slope in terms of each additional $1000 spent per student, 

c. What average SAT result does this regression line predict for students 
in a district that spends $10,000 per student? 

d. According to this analysis, how much should a district spend per 
student in order for its students to average 1000 on the SAT exam? 
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3, According to The New York Times (April 2, 1993, page AI), the average 
monthly rate for basic television cable service has increased as follows: 


‘Year: 1986 198719881989 1990 1991-1992 
Rate: $11.00 1320 1390 1520 1680 18.00 20.00 


a. Find the equation of the best-fitting straight line. 
b. Interpret the slope. 
¢. Predict the average monthly rate in 1993 (actually it was $21.00). 
d., Predict in what year the rate will reach $50.00. 

4, The heart disease death rates per 100,000 people in the United States for 
certain years, as reported by the National Center for Health Statistics, were 
as follows: 


Year: 1950 1960 197019751980 
Death rate: 307.6 2862 2536 2178 2020 


a. Find the equation of the bestfining straight line. 
b. Interpret the slope. 
©. Predict the death rate in 1983 (actually it was 188.5). 
. Predict in what year the death rate will drop to 100. 

5. In studying the “ozone-deficit” problem (Science, September 23, 1994, 
Page 1835), one experiment measured laser-induced fluorescence (LIF) at 
various ozone partial pressures, obtaining the following results: 


‘Ozone partial 
pressure (mor): 8 16 27 41 5267 BS 
UF intensity: 15 3 42 75° «98 SS 


a. Find the equation of the best-fitting straight line. 

b. Interpret the slope, 

. Predict the LIF intensity if the ozone partial pressure is 37 millitorr. 

. Predict what ozone partial pressure would result if the LIF intensity 
was 18. 


Study Question 
Suppose we wish to minimize 
LO — YF = Oy — HF + On HF t.+ On 
where yj = 5 + m(x, —%) 


We consider 
LE — (F + mx — DP = Ly — F- mx — DP 
= Ly — FP — Wm Ly - FXx - ¥) 
+ mi? L(x — RP 
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Using the fact that am? + bm + c has its lowest point when m = ~b/2a, 
show that we want 
= 20-P2-*» 
™ Le = 3 
Expand to 
= Ly - Ej - EH + ESD 
Lx —2Laet+ Le 
Now, using the fact that ¥ and } do not vary and so can be factored out of 
summations (e.g., 2x7 = FE, and the fact that Lx = nF and Ly = rij, show 
that the above expression for m can be algebraically reduced to 
Poe ane 
Dx — ny 
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CORRELATION 


‘There are ways to gauge whether the perceived relationship between 
variables is strong enough so that finding the regression line and making use 
of it are meaningful. Although a scatter diagram will usually give an intuitive 
visual indication when a linear relationship is strong, in most cases it is quite 
difficult to visually judge the specific strength of a relationship. For this reason 
we develop a mathematical measure called the correlation. Important as 
correlation is, we need always to keep in mind that significant correlation does 
not necessarily indicate causation. 


J. Consider the data and results from Example 21.1. How good a fit 
is y/ = 2.548x ~ 52.16? For example, compare the actual values 
to the values predicted by the regression formula. Plugging in the 
various values of x gives 2.54838) — 52.16 = 44,664, 2,548(53) 
= 52.16 = 82.884, 254842) — 52.16 = 54.856, 2.548(47) = 
67.596. 


x y y 


38 45 44.664 
53 86 82.884 
2 58 54.856 
7 a 67.596 


As tabulated above, the observed j-values and regression y/-values appear 
to be close. However, are they close enough to be significant? 


CORRELATION COEFFICIENT 


‘One measure of an apparent relationship, such as the one above, is called 
the correlation coefficient and is denoted as r. Actually, r is related to the 
Fratio studied in Chapter 15. Like the Fratio, r is the square root of a ratio of 
variances. 


x ya PF wa 


m 


In other words, r°, called the coefficient of determination, is the ratio of the | 
variance of the predicted values, y’, to the variance of the observed values, 9. 
Alternatively, we can say that there is a partition of the variance, and ris 
the proportion of this variance that is predictable from a knowledge of x. 

Continuing Example 22.1, we calculate the variances as follows. From an 
earlier calculation we have ) = 62.5, and so 


pe = (NGG ~ 62592 + (82884 ~ 625) + (54856 ~ 625% + (67596 ~ 625) 
3 3 
= 21893 — 204508 and 


= 15 = 625) + (86 ~ 625) + 68 ~ 6257 + G1 - 625" _ 


220.25 


‘Thus 


and r= .9636 
It can be shown algebraically that 


‘The correlation coefficient rwill take the same sign as m (the slope of the re- 
gression line). The value of ralways falls between ~1and +1, with —1 indicat- 
ing perfect negative correlation, and +1 indicating perfect positive correlation, 


I rat ret 


Tacoma Tedomw-t a) 


HYPOTHESIS TEST FOR CORRELATION 


22.2 Consider Example 21.2. How good a fit is yf = 0.5492" — 1.737 
Plugging in the various values of «to compare the actual values 
to the values predicted by the regression formula gives these 
results 


x y y 
101200 
1 1090 
4475 
2 1200 

8 925 
1K 1639 
9 816 
0 9.80 
01035 
475 
no 1Rss 
1% 1255 
2 1310 
1420 
n 9.25 


Are the observed y values and the regression values close 
enough to be significant?’ We noted earlier that 1 = 15, Ex = 375, 
X= 25, Sy = 180, 5 = 12, Ex? = 9639, and Exy = 4645. We now 
calculate 


Bye = 10? +18 +142 +... 4 12 = 2262, 


and thus 
= (4645 = 1502512) = 04s 

"= BEI — 1SGSPX2R — THAD) ~ Boiraoy ~ 788 

and r= 8836. 


Is r = .9636 with four data pairs significant? How about .8836 
with 15 data pairs? If there really is no correlation, what are the 
chances that the rvalues could have been this lange? As expected, 
given our earlier hypothesis testing techniques, these answers 
depend on the level of significance with which we are working, 
Table E in the Appendix lists of critical rvalues for both the 5% 
and 1% levels of significance; that is, with no correlation, there are 
(05 and .01 probabilities of obtaining the given critical levels. Thus, 
as before, we talk about a-risks of .05 and .01, respectively. Note 
that to use Table E, the number of degrees of freedom must be 
known—in this situation df= n — 2. 

We have 


H,: no correlation 
H,: correlation 


If our calculated r, in absolute value, is greater than the critical 
‘we reject H, and say that at the given significance level there is 
sufficient evidence that r, the correlation coefficient, is significant. 

For Example 22.1, df= 4 — 2 = 2, which gives critical values 
of r of .950 and 990 at the 5% and 1% significance levels, 
respectively; .9636 > .950, but 9636 < .990. Thus, if we are 
willing to accept a 5% probability of committing a Type I error, 
we can conclude that there is sufficient evidence to indicate 
correlation. However, if we are willing to accept only a 1% 
probability of committing a Type I error, there is not sufficient 
evidence to reject the null hypothesis. 

For Example 22.2, df= 15 ~ 2 = 13, which gives critical values 
of .514 and .641 at the 5% and 1% significance levels, respectively; 
8836 > 514, and .8836 > .641. Therefore, at either of these 
significance levels, we conclude that there 4 sufficient evidence 
to indicate correlation. 


Following are the lengths and grades of ten research papers for a 


————_ sociology professor's class. 


Length (pages: x 25 32 20 28 15 4 2 30 45 35 
Grade: y ® 81 72 75 6 8 BI 73 92 96 


a. Find the equation of the regression line. 


Answer: We calculate as follows: 
Ex = 25+32+...435 = 23 
Ey = + 81+...+ 86 = 785 
Exy = 2569) + 32181) +... + 3586) = 23619 
Ex? = 25 + 322 +... + 35% = 9205 
yt = 652 + BP +...+ 86 = 62393 


¥ = 23 = 93, 5 = Be 
B= Fp = 293, 5 = Fy = 785 


23619 ~ 10(293)785) _ 6185 _ 
9205 ~ 10(29.3? 6201 0977 


¥ = 78S + 0997(x — 293) = 0997x + 493 
b. Plot a scatter diagram and graph the regression line. 
Answer: 


bal Length (pages) 


c. Use the equation to predict the grade for a student who turns 
in a paper 40 pages long. 

Answer: The grade for a student who turns in a paper 40 pages 

long is predicted to be 0.997(40) + 49.3 = 89.2. 

d. What is the slope of the regression fine, and what does it 
signify? 

Answer: The slope of the regression line indicates that, for each 

additional page, students can increase their grades by 0.997, that 

is, approximately 1 point per page. 
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. Test for correlation at both the 5% and 1% levels of significance. 
Answer: We calculate as follows: 


. e185 . 

C2ODEIN3 — TTBS) 
andr = 895 
With df= 10 ~ 2 = 8, the critical values are .632 and .765 at the 
3% and 1% significance levels, respectively; 895 > .632 and .895 
> .765. Therefore, at either of these significance levels we con- 
Glude that there is sufficient evidence to indicate correlation 
between a student’s grade and the number of pages in the 
research paper. 


r 801 


‘The shoe sizes and the number of ties owned by ten corporate 
vice presidents are as follows. 

Shoe size: z895 92 9 95 839 9 9S 
Numberofties y 10 10 8 15 12 13 16 7 12 4 
a. Draw a scatter diagram for this data 

Answer: 


8 . 


wpe 2 


b, Test for correlation with a = .05. 


Answer: We calculate as follows: 


Be = 8495 4...495 =92 
By =10+104...44 =107 
Say = 80) + 95410) +... + 95(4) = 9885 
Bet = 8 4958+. 4958 = 852 
By? = 102 +102 +. = 1.267 
¥= 292,53 == 107 


(9885 ~ 10692107) 
(852 = 10092)? 1267 — 100077) 


r= 1568 
With df = 10 - 2 = 8 the critical r value at the 5% level of 


significance is ,632. Since .1568 < .632, we conclude there is not 
evidence of correlation, 


a= 


= 02458 


¢. Can we use the techniques of this chapter to find the best fiting 
straight line approximation to the above data? Does it make 
sense fo use this equation to predict the number of ties owned 
by a corporate executive who wears size 10 shoes? 


Answer: Aithough theoretically we can use our techniques to find 
the best fitting straight line approximation, the result will be 
meaningless and should not be used for predictions. 


EXERCISES 


1. As reported in the Journal of the American Medical Association (june 13, 
1990, page 3031), for a study of 10 nonagenarians (subjects were age 90 
+ 1), the following tabulation shows a measure of strength (heaviest 
weight subject could lift using knee extensors) versus a measure of 
functional mobility (time taken to walk 6 meters). 


Suengh(k: 75 6 MS 105 95 18 4 12 9 3 
Wilktime@: 18 46 8 23 2% 7 2 12 0 4 


a. Find the equation of the regression line. 
b. Find the correlation coefficient, and test for correlation at the 5% 
significance level. 
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¢. What is the sign of the slope of the regression line, and what does it 
signify? 
2. The following table (Statistical Abstract of the United States) gives the US. 
population (in millions) in various years of the twentieth century. 


1900 76.1 19590 1519 
1905-838. 1955 165.1 
1910924 1960 1800 
1915 1005 1965 1935 
1920 1065 1970 2040 
1925158. 1975-2155 
1930 123.1 1980 227.2 
19351273 19852379 
1401325 1990 249.4 
IMS 1334 


Find the equation of the regression line. 
Find the correlation coefficient, and test for correlation at the 1% 
significance level. 
c. Predict what the population will be in the year 2100. 
d. What is the slope of the regression line, and what does it signify? 
€. Predict when the population will reach 300 million. 
3. In modeling population growth, an exponential function often gives a 
better correlation than a linear function. 
a, Using the data from Exercise 2, make a table of years since 1900 versus 
the natural log of the above population values. 
bb, Find the equation of the regression line of In y in terms of x (and 
calculate the correlation coefficient), and then soive for y in terms of x. 
Your result will have the form 


ya yee 
where y is the population (in millions) at a time x years since 1900, 
. Predict what the population will be in the year 2100. 
d. Predict when the population will reach 300 million 


4, The following table (Statistical Abstract of the United States) gives personal 
income in various years: 


oP 


1935 $467 1965 $2,845 
1940 586 1970 4,052 
1945 1,215, 1975 6,053 
1950 1,502 1980 9.948 
1955 1,903 1985 14,170 


1960 2,264 1990 18,699 


Find the equation of the regression line. 
. Find the correlation coefficient, and test for correlation at the 5% 
significance level 
. Predict what the personal income will be in the year 2050. 
4. What is the slope of the regression line, and what does it signify? 
€. Predict when personal income will reach $25,000. 
5. a. Using the data from Exercise 4, make a table of years since 1935 versus 
the natural log of the above personal income values. 
b. Find the equation of the regression line of In y in terms of x, and 
calculate the correlation coefficient. 
. Solve for y in terms of x. 
d. Predict what the personal income will be in the year 2050. 
€. Predict when personal income will reach $25,000. 


Study Questions 
1, Find an example for which r = 0. Plot the scatter diagram. 
2. a. Given that 


oe 


and that y= 3+ m(x ~ 3, show that 
P = 
an MUG — De 
‘ 9 =F 
b. Now, using from the Study Question in Chapter 21 the equation 
_ LQ-he-¥ 
a 


a 2 = LEO = Xx - BP 
—— Las L9-7F 
c. Using expansion and algebraic techniques derive that 


Cry = ny? 


"ox = na — OF 


3. An alternative approach is to standardize all scores, that is, to replace x by 
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where 


and to replace y by 


=, 2222 
¢, 
where 
o,= ey 


If there were complete positive linear dependence of yon x, then, for each 
standardized change in x, there would be an identical standardized change 


in y; thats, z, would equal 2, forall and y, and = would equal 1. More 
generally if 


then the correlation coefficient, r, can be defined as the slope r= =. 


a. Starting with zy = rz, use the properties of variances and the fact that 
63, = 1 to show that 


ear 


b. Staning with z=“, of ¥ = 0,2 +5, use the properties of 
variances to show that”? 


= (oot, 


« Combine the results of parts @ and b and conclude that +? = 

7 
In this chapter we defined the square of the correlation coefficient to be a 
ratio of the variance = OP” to the variance SUI” there, By ~F* 
is called the explained variation, while 33 — is the total variation. 


In Example 22.1 we found that the explained variation (y/ ~ j)* = 818, 
while the total variation 3(.y ~ 9)? = 881 


a. Calculate X(y— y/, called the unexplained variation, and show that 
it is equal to the (otal variation, 881, minus the explained variation, 818. 

, Calculate the explained variation, the unexplained variation, and the 
total variation in Example 22.2. 


270 Random Somples 


In the real world, time and cost considerations usually make it impossible 
to analyze an entire population. Does the government question you and your 
parents before announcing the monthly unemployment rates? Does a producer 
check every household's viewing preferences before deciding whether a pilot 
program will continue? In studying statistics we learn how to estimate popula- 
tion characteristics by considering a sample. For example, in this book we 
estimate population means and proportions by looking at sample means and 
proportions, 

To derive conclusions about the larger population, we need to be confident 
that the sample we have chosen fairly represents that population. Analyzing 
the data with computers is often easier then gathering the data, but the 
frequently quoted “garbage in, garbage out” applies here, Nothing can help if 
the data are badly collected. Unfortunately, many of the statistics with which 
we are bombarded through newspapers, radio, and television are based on 
poorly designed data-collection procedures. 

‘Two common techniques that often result in flawed conclusions are volun- 
tary response samples and convenience samples. Voluntary response samples, 
based on individuals who offer to participate, typically give too much emphasis 
to persons with strong opinions. For example, radio call-in programs about 
controversial topics such as gun control, abortion, and school segregation do 
not produce meaningful data on what proportion of the population favor or 
oppose related issues. Convenience samples, based on choosing individuals 
Who are easy to reach, are also suspect. For example, interviews at shopping 
malls tend to produce data highly unrepresentative of the entire population, 

Poorly designed sampling techniques result in bias, that is, in a tendency to 
favor the selection of certain members of a population. An often-cited example 
of selection bias is the Literary Digest opinion poll that predicted a landslide 
victory for Alfred Landon over Franklin D. Roosevelt in the 1936 presidential 
election. The Digest surveyed people with cars and telephones, but in 1936 
only the wealthy minority, who mainly voted Republican, had cars and 
telephones. Examples of nonresponse bias are most mailed questionnaires. 
They tend to have very low response percentages, and it is often unclear which 
part of the population is responding. Unintentional bias often creeps in when 
the surveyor tries to systematically pick people representative of the whole 
population. For example, in 1948 the Chicago Tribune incorrectly called 
Dewey the winner over Truman. Here the mistake was partly due to misleading 
polls based on quota sampling which left the interviewers (00 much free 
choice in picking people to fill their quotas. 

How can a good, that is, a representative, sample be chosen? The most 
accurate technique would be to write the name of each member of the 
population on a card, mix the cards thoroughly in a large box, and then pull 
‘out some specified number of these cards. This method would give everyone 
in the population an equal chance of being selected as part of the sample. 
Unfortunately, this method is usually too time consuming and too costly, and 
bias might still creep in if the mixing was not thorough. A simple random 
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sample can more accurately be obtained by assigning a number to everyone 
in the population and then having a computer generate random numbers to 
indicate choices. 

Time and cost-saving modifications are needed to implement sampling 
procedures. Systematic sampling involves randomly listing the population in 
order, and then picking every tenth, hundredth, or thousandth, etc., person 
from the list. In stratified sampling the population is divided into representative 
‘groups called strata, and random samples of persons from all strata are chosen, 
‘Multistage sampling involves dividing the population into groupings, then each 
‘grouping is subdivided, a random sampling of the subdivisions is selected, and 
finally a random sample of people is picked from the selected subdivisions. 
‘What is crucial in all these techniques is the presence of a systematic procedure 
that involves the use of chance and leaves no freedom of choice to the 
interviewers, 

In statistical studies that aim to confirm relationships between variables, we 
must be cognizant of the difference between observational versus controlled 
studies. In observational studies there is no choice in regard to who goes into 
which group. For example, a researcher cannot ethically tell 100 people to 
smoke three packs of cigarettes a day and 100 others to smoke only one pack 
per day; he can only observe people who habitually smoke those amounts. In 
a controlled study, however, the researcher can randomly divide subjects into 
appropriate groups. For example, patients may be randomly given unmarked 
capsules of either aspirin or acetaminophen and then the effects measured. 
Controlled studies often have a treatment group and a control group, in the 
ideal situation, neither the subjects nor their attendants know which group is 
which. The Salk-vaccine experiment of the 1950s, in which half the children 
received the vaccine and half were given a placebo, with not even their 
doctors knowing who received what, is a classic example of this double-blind 
approach. 


APPENDICES 


x | oo =. 2 sts ta 
0.0 | .0000 0040 0080 0120 0160 0199 0239 0279 0319 0359 
0.1 | 0398 0438 0478 05170557 0596 0636 0675 0714. 0753 
0.2 | 0793 0852 0871 0910 098.0987 10261064 1103. 1141 
03 | 1179 1217 125512931331 .1368 1406-1443 14801517 
OA | 1554 15911628 1664 1700-1736 1772-1808 1844-1879. 
05 | 1915 1950 1985 2019-2054 2088-2123 .2157 2190 2224 
06 | 2257 2201 2324 2357 2389 2422-2454 248625172549. 
0.7 | 2580 2611 2642 2673 270427342764 2794 28232852 
0.8 | 2881 2910 29392967 29953023 3051 30783106 3133 
0.9 | 3159 3186 32123238 3264 32893315 334033653389 
1.0 | 3413 34383461 34853508 353135543877 35993621 
1.1 | 3643 3665 3686 3708 3729 374937703790 3810 3830 
1.2 | 3849 3869 38883907 392539443962 398039974015, 
13 | 4032 4049 4066 4082 4099 4115-4131. -4147 4162417 
14 | 4192 4207 4222 42364251 426842794292 43064319 
15 | 4332 434543574370 438243944406 4418 44294441 
16 | 4452 4463 A474 4484 4495-4505 45154525 45354545, 
1.7 | 4554 4564 4573. 4582 4591 4599 4608 4616 46254633 
18 | 4681 4649 4656 4664 4671 4678 4686 4693 4699-4706 
19 | 4713 4719 4726 473247384744 475047564761 4767 
20] 4772 4778 4783 478847934798 4803-4808 48124817 
21 | 4821 4826 4830 4834 483848424846 485048544857 
2.2 | 4861 4864 4868 48714875 4878 488148844887 4890 
23 | 4893 4896 4998 4901 4904 4906 4909 4911 4913 4916 
24 | 4918 4920 4922 4925 4927 4929 4931 4932 4934 4936 
25 | 4938 4940 4941 4943 4945 4946 4948 4949 4951 4952 
26 | 4953 4955 4956 4957 4959 4960 4961 4962 4963 4964 
27 | 4965 4966 4967 4968 4969 4970 4971 4972-4973. 4974 
28 | 4974 4975 4976 4977 4977 4978 4979 4979 4980 4981 
2.9 | 4981 4982 4982 4983 4984 4984 4985 4985 4986 4986 
3.0 | 4987 4987 4987 4988 4988 4989 4989 4989 4990 4990 


Abridged from Table Tof A. Hald, Statistical Tables and Formulas (New York: John Wiley & Sons, 
Ine.) 1952. Reproduced by permission of A. Hald and the oublisher. lohn Wiley & Sons. Inc. 


See i 
“ey Oe 2 
«o =o 

ey 


274 


oz 


tz 
soz 


sor 
soz 
woz 
nz 
siz 
oz 
wz 


‘wopaeyj jo seauop soyommousg 


275 


zy oy GY HS 
WF LY WS mS 
wy oY IS WS 
my WS WS SOS 
ws eS WS wo 
OS WS HS 559 
oss 99 tH 669 
49 99 WoL 6 
Gre OL SRL HR 
we Ste S16 BLO 
cor om yu rz 
Ts Sst og COL 
oz te Le Se 
$06 665 TO TH 
6658S HOLS STIS FHS 
9 Ss 6 f 


wopaayy jo seaiBep sojpenousg 


276 


Soon], FyEMOKG 249 Jo VOSS Aq "9561 "TION SHIM 40/ SRNL BMAOMCHT JO BL >KTRL WOH] PRONPOKIA, 


wopeasy jo sseifep soyoumoueg 


TABLE D. THE x?-DISTRIBUTION 


Xs Xie xis Xia Xie | Xe Xow hos 


00000002 0010 0039-0158] 2706 3.81 
0100 0201 0306. .1026 2107] 4.605 5.991 
0717 114821583818 S844] 6.251 7.815 
2070 «29714847107 1.064 | 7.779 9.448, 
AM? 5543-8312 .AS 1.610 | 9.236 11.07 
6757-8721 1.237 1.635 2.204 | 10.64 12.59 
9893 1.239 1.690 2.167 2.833 | 1202 14.07 
1344 1.647 2.802.732 3490 | 13.36 15.51 
1.735 2088 2700 3.325 4.168 | 14.68 16.92 
2156 2558 3.247 3940 4.865 | 1599 1831 
2603 3.053 38164575 5.578 | 17.27 19.68 
3074 3571 4404 5.226 © 6.304 | 18.55 21.03 
3.565 4.107 5.009 5.892 7.012 | 1981 22.36 
4075 4.660 5.629 6571 7.790 | 2106 23.68 
4601 5.229 6.262 7.261 8547 | 2231 25.00 
5.142 S812 6.908 7.962 9.312 | 2354 26.30 
5.697 6408 7.564 8672 10.09 | 2477 27.59 
6.265 7.015 8.231 9.390 10.86 | 25.99 28.87 
6844 7.633 8907 10.12 11.65 | 27.20 30.14 
7434 8260 9.501 1085 1244 | 2841 3141 37 M 
806M 8897 10.28 1159 13.24 | 2962 3267 3548 3893 41.40 
B63 9.542 10.98 1234 14.04 | 3081 3392 36.78 40.29 42.80 
9.260 1020 1169 1309 4.85 | 3201 35.17 3808 4164 44.18 
9886 1086 1240 1385 15.66 | 33.20 3642 39.36 42.98 45.56 
10521152 13.12 1461 16.47 | 34.38 3765 4065 44.31 46.93 
1379 1495 16.79 1849 20.60 | 40.26 4377 46.98 S089 53.67 
2071 2216 2443 26.51 29.05 | S181 55.76 59.34 63.09 66.77 
2799 2971 3236 M76 37.69 | 63.17 6751 7142 76.15 79.49 
3953 3748 4048 43.19 46.46 | 7440 79.08 83.30 8838 91.95 
43.27 4544 48.76 S174 5533 | 8553 9053 95.02 1004 104.2 
S117 5354 S715 6039 64.28 | 96.58 1019 1066 11231163 
59.20 6175 (65.65 69137329 |1076 «113.1 BA 124.1 1283 
67.33 70.66 74.22 77.93 8286 1185 1243 1296 135.8 140.2 
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Adapted with permission from Biometrika Tables for Statisticians, Vol. 1. 3d ed., Cambridge 
University Press, 1966, edited by E. S. Pearson and H. O. Hartley. 
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TABLE E, CRITICAL LEVELS OF 1 AT 5% AND 1% LEVELS OF SIGNIFICANCE 


a ts in a bs tn 
1 997 1.000 4 388 496 
2 950 990 3 381 487 
3 878 9 6 374 478 
4 sit 917 7 367 470 
5 74 874 B 361 463 
6 707 84 2» 355 456 
7 666 798 0 39 “49 
8 632 765 35 325 418 
9 602 738 0 304 393 
10 576 708 4s 288 372 
uN 553 634 50 23 354 
2 532 61 0 250 325 
13 514 Al 7 232 302 
4 497 623 % 217 283 
15 482 605 ” 205 267 
16 468 590 100 195 254 
7 456 575 125 174 28 
18 AA 561 150 159 208 
19 433 549 200 138 181 
2 423 537 300 m3 48 
a 3 526 400 098 128 
2 404 515 500 088 as 
2B 396 505 1000 062 081 


Reproduced by the courtesy of the author and of the 
Cochran, Statéstical Methods. The Towa State University 


I 


from G. W. Snedecor and W. G. 
‘Ames, lowa, 1967, Table A1J, p. $57. 


i 


2. The median is (11,187 + 9,333/2 = 10,260, the mean is 42,785.75, and 
the trimmed mean (with the upper and lower 25% removed) is 17,252. 
3, From (0.70 ~ 0.11 + 1.34 + 0.05 + 20/5 = 0.086, the return was ~1.55%, 
4, a, 255 Ib 
b. 265 Ib 
. 280.5 Ib 


5. The geometric mean of 31.07 organisms per deciliter does not exceed the 
EPA limit. 

6. Fifty percent of the values are on either side of the mean. Since 3% + 20% 
+ 21% = 44% while 3% + 20% + 21% + 23% = 67%, the median number 
of partners for men must be in the 5-10 category. Similarly, the median 
number of partners for women is in the 2-4 category. (The actual answers 
from the study are a median of 6 for men and 2 for women.) 


Chapter 2 
1, Mean is 147.55, range is 82, and standard deviation is 25.13. 


2. a. 17.59, 18.41, 5.94 
b. 19.09, 18.41, 5.94; 20.23, 21.17, 6.83 


3. $0.17, $0.19, $0.06 


Chapter 3 
1. 866, 92%, 4% 


2. a. (221 ~ 206)/35 = 0.43 
(216 — 206)/35 = 0.29 


3. Fifth out of 20, 75%, (14.23-18.87)/5.33 = —0.87 
4. QOL — 7.2/2.5 = 0.96 
5. (2319 — 1751)/552 = 1.03 
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6, a. 62.5 & 218.0) = 62.5 + 36.0 or (26.5, 98.5) 
b, The index will fall below 26.5 about 59/2 = 2.5% of the time. 
7. Use Chebyshev’s theorem. 


a. 475 = 2(55) = (365, 585) 
b. (475 — 310)/55 = 3 and (640 — 475)/55 = 3; 1 — 1/3? = 88.9%. 


Chapter 4 
1 
15| 
10 
Number 
ot 
5 
rT 
Calories 
2 
29] 
8 
‘Number 
ot 
‘vers 40 
5 
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3. First calculate p. = 138.7 and o = 41.0. 


157 867 97.7 1987 1797 220.7 261.7 Index 
3 2 4 ° 1 2 3 Zscore 


4, First calculate p = 54.83 and o = 6.22 


5.3, fi 
R 
B 
“ 
15 
16 
v 
18 
19 
20 
2 
2 


b. Arranging the set in numerical order gives (110, 117, 120, 122, 122, 125, 
125, 126, 127, 127, 127, 128, 128, 133, 137, 137, 141, 146, 147, 148, 149, 
150, 150, 150, 152, 154, 154, 155, 155, 159, 162, 164, 164, 165, 166, 168, 
168, 169, 174, 176, 176, 177, 189, 190, 193, 193, 195, 207, 219, 228). The 


* 


seomus 
ote 
en 


BeIMerB SSeuaec 
o 
° 
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smallest value is 110, the largest 228. The median is 153, the median of, 
the top half 169, the median of the bottom 128. 


267 


100 


b my 3, 2 3 

w}oirti223 4447788 
w~l}oo124455677779 
3|/15 7888999 

alo 9 


c. p = 266.4 anda 
d. 75%; 40 of 41, or 97.6%, are in (248.4, 284.4) 
€. 689%, 959%, 9996, 70.796, 97.69%, 100% 

7. The median is less than the mean, so the responses are probably skewed 
to the right; there are a few high guesses with most of the responses on 
the lower end of the scale. 


Chapter 5 
1, Using the multiplicative rule, Michener calculated 13 X 17 X 23 X 37 = 
188,071, 
2. Boston Chicken advertisers calculated the permutation (16,3) = 3360, 
when they should have calculated the combination €(16,3) = 560, 


3. Using the multiplicative rule gives 27 = 128. 


Chapter 6 

1. a. 1/4 + 1/7 = CAAA?) & 5/14 
b. V/A 

2. Marilyn assumed independence of events (success for surgery A and 
surgery B), which is most likely wrong! 

3. If p = 1/9,000,000,000, then PCO errors) = (1 — p)?0.0 = 99778 and 
Prat least 1 error) = 1 — .99778 = .00222. 

4. 100.1585) = 138 
1 = [(85)8 + 515X859) = 165 
(89° + 5C1SK85)! + 10(.15)4.85) = 973 

5a 5 
b. C5" = 0039 

6. 1 — (99975) = .0247 


Chapter 7 
1, .001(1,500,000) = 1500 
2. .03(1200) = 36 


3. w= 0C.6) + 1000.3) + 10000(.1) = $1300 
o? = (0 — 1300.6) + (1000 — 1300.3) + (10,000 — 1300.1) = 
8,610,000 and o = $2934 
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5. w = 231/336 = 0.631 
0681 = 532, 0,631e°! = 366, (0.6317/2e! = 106, (0.631/3) 
0651 = 022, (0.6314/4eS! = 004 
‘Then .532 X 366 = 195, 336 X 366 = 123, 106 x 366 = 39, 022 X 366 
, 004 X 366 = 1.5. 
(Actual numbers were surprisingly close: 201 days with no deaths, 118 
days with one death, 33 days with two deaths, 10 days with three deaths, 
3 days with 4 deaths, and 1 day with five deaths.) 


Chapter 10 
1, 5 = 1915 = 3085, 5 — 4772 = .0228 
2, 5 — 2422 = 2578, .5 + 4484 = 9484, 5 ~ 1879 = 3121, 
5 + A982 = 9982 
2881 + 3849 = .6730, 4974 ~ 2881 = 2093, 4772 — 3849 = 0923 
5 — A772 = 0228, 500 ~ 1.645(100) = 336, 500 
30 ~ 2.334) = 20.7, 30 + 1.645(4) = 36.6 
9500 — 1.28(1750) = 7260, 9500 + 1.96(1750) = 6070 and 12,930, 
9500 + 2,33(1750) = 5422 & 13,578 
7. 100 + 0.920 = 120, = 21.7 
B. p+ 1.042) = 16, p = 13.92 
9. (w= 0.676 = 75, w+ 1.04 = 150), w= 104.39, o = 43.86 
10. € + 2.58(0.4) = 8, ¢ = 6.97 
1. w= 29.81) = 23.49, 0 = 21 
5 ~ 4706 = 0294, .1844 — 0047 = .1797, 5 — .0047 = 4953 
12, w= 30.5) = 15, 6 = 2.739, 5 — 4495 = 0505 
13. a. 1225.05.95)" = .261, 19600(.05).95)" = .220 
(253/225 = 256, (2.5/3De25 = 214 
b 2422, 2422 


away 


Chapter 11 
1, 4.4 © 1,645(.35/N99) = 4.4 + 0.082 
2, 28.5 + 1,961,230) = 28.5 + 0.43 


3. a. 64 + 74,59 + 67 
b. 126 + 6.02, 101 + 4.26 
c. 4029 + 100.2, 4827 + 101.4 


4. a3 = 1.52/N64 = 0.19, 0.38/0.19 = 2, 4772 + 4772 = 9544 


5. 9540 * 2,58(1205/V30) = 9540 = 567.6 
335(9540 + 567.6) = $3,196,000 = $190,000 
6. 1.960.1/Vn) = 0.2, Vn = 10.78, n = 116.2; choose n = 117. 


7. 
8 


9. 


10. 
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450) = 200 

22, 5 = 248, 0; = 248/V35 = 4.19 

22 * 2.33(4.19) = $22,000,000 = $9,770,000 

R= 2753.8/49 = 56.2, & = 1348.32/48 = 28.09, s = 5.3, 

3 = 5.3/NI9 = .757, 56.2 + 1.645(.757) = 56.2 + 1.25 

378/36 = 10.5, # = (4123.35 — 3787/36/35 = 4.41, s= 2.1, 
oy = 2.1/N36 = 0.35, 10.5 * 1.96(0.35) = 10.5 * 0.686 


Chapter 12 


i: 


93 = 0.93 = 0.15, 3.25 ~ 3.5/0.15 = ~167, 
5 — 4525 = 0475 


2, a= 4.50/N40 = 0.712, ¢ = 25 + 1.645(0.712) = 26.17 
3. € = 12 + 2,05(6/NO4) = 13.54, Since 13.1 < 13.54, there is not sufficient 


10. 


evidence to reject the city spokesperson’s claim. 


|. a. og = 3.5/ND = 0.5, (16.25 — 15/05 = 2.5, p= 5 — 4938 = .0062. 


b. Since .0062 < .01, there és sufficient evidence to doubt the 15-minute 
claim at all three levels. 

a. (1,72 ~ 1.6/0.ANGA) = 24, 5 — 4918 = .0082, and because this is 
a twoxtaled test, p = 26.0082) = 0164. 

b. Thus the claim should be disputed at the 10% and 5% significance levels 
but mot at the 1% level. 

a, F = 109.08/36 = 3.03, ¢ = 0504/35 = 0.0144, s = 0.12, of = 
0.1236 = 0.02, (3.03 ~ 30.02 = 1.5, p = .5 ~ 4332 = .0668. 

b. Since .0668 is greater than .01, .02, and .05, the manager's claim should 
not be rejected at the 1%, 2%, or 5% significance level. 

a, ¥ = 2370/30 = 79, # = (197,287 ~ 23707/30)/29 = 346.79, s = 18.62, 
ay = 18.62/V30 = 3.40, (79 — 85)/3.4 = —1.76, p = 5 — 4608 = .0392, 

bb, Since .0392 < .05, there 4s sufficient evidence to reject the agent’s claim 
at both the 10% and 5% levels of significance. 

=05, B= .5 + 1915 = 6915.25 
3085, 


For 22, (23.14 — 22)/0.8 = 1.43, B 
For 20, (23.14 — 20)/0.8 = 3.925, B 


oz = 0.936 = 0.15. 


For a = .10, ¢ = 1.5 + 1.28(0.15) = 1.692, (1.692 ~ 1.8/0.15 = -0.72, 
B= 5 — 2642 = 2358. 
For a = .05, ¢ = 1.5 + 1.645(0.15) = 1.747, (1.747 ~ 1.8/0.15 = ~0.35, 
B= 5 — 1368 = 3632, 


€ = 0 + 1,28,0294) = .0376, 2.6 — 2.56 = .04 
‘There és sufficient evidence at the 10% significance level to say that liberal 
arts colleges give higher grades. 

8, Standard deviations from the two samples are needed. 


Chapter 14 

1, df= 15, 74 2.94701.3NT6) = 74 * 96 

2, ¥= 35.7/17 = 2.1, # = 0,096/16 = 0.006, s = 0.0775, df= 16, 21 + 
1,337(0.0775/V17) = 2.1 + 0.025 

3. B= 4,048, s = 2.765, df= 20, 4.048 + 1.725(2.765/V21) = 4.048 + 1.041 

4. a. 7.5 * 2.262(4.085/N9) = 7,5 + 3.08 
b, 19.11 * 2.26266.791N9) = 19.11 + 5.12 
c. 11.61 * 2.26204.891/N9) = 11.61 + 3.69 

5. € = 1250 — 1.796(616/V12) = 930.6 
Since 934 > 930.6, there is not sufficient evidence at the 5% significance 
level to dispute the IRS claim. 

6. Critical values are 2000 + 2.093(600/V20) = 1719.2 and 2280.8. Since 2250 
is inside this range, there is not sufficient evidence at the 5% significance 
level to dispute the auditor's remarks. 

7. R= 04/18 = 28, ¢ = (1638 — 50.47/18)/17 = 1.334, s= 1.155, ¢ = 2 
+ 2.567(1.155/V18) = 2.70. 

Since 28 > 2.70, there is sufficient evidence at the 1% significance level 
to dispute the 2-hour claim. 

8. F= 5.67, s = 0.763, c= 5 + 2.015(0.763/V6) = 5.63. 

Since 5.67 > 5.63, there és sufficient evidence at the 5% significance level 
to dispute the company scientist's claim. 

9. B= 24.581, s= 0.7793, c= 25 — 2.485(0.7793/V26) = 24.620. Since 24.581 
< 24,620, there 4s sufficient evidence at the 1% significance level to dispute 
the null hypothesis of 25. 

10. X= 301.3, s = 2.21, critical values are 300 + 3.143(2.21/V7) = 297.37 and 
302.63. Since 301.3 is inside this range, there is not sufficient evidence at 
the 2% significance level for the FDA investigator to reject the label. 
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u 
_ (Enon s = paF fr, 7 _ 
- O46 =z 6 6 oe 


(75 — 71) = 3.160.899) = 4 + 2.85 


ae (@—ne3r +G—vesr fT 7 
Ss ae 


1421 


(81 ~ 98) + 2201(1.421) = ~1.7 + 3.13 
13, 5 n= 7 
6 = 78 
5, = 354 10.23 
(5 = asa + 


1023) ie 
ety = 4822 


(61 — 78) * 1.812(4,822) = -17 + 8.74 


3-5 = St7— 


“ 

_ [co nas + c0—peDe f 738 
7 10+10-2 1G; 210) ee 
= 0 ~ 17340.114) = ~ 0.198, 
Since 4 — 48 = -08 < -0.198, there 4s sufficient evidence at the 5% 
significance level that the new diet improves weight gain in laboratory 
mice 


15, m=8 m=6 


%=7 8%, =67 
5 = 832 5, = 998 


(8 = 1832) + (6 = 119.98" 1. 
B+6-2 were" 


Critical values are 0 + 1.356(4.89) = + 6.63. 
Since 70 ~ 67 = 3 is inside the range, the observed difference in typing 
speeds is not significant with « = .20. 


Chapter 15, 
1, 3 = 375, 5 = 05, % 


25, % = 0.957, X = 2.75, s = 1.258 


b= GIS + 1.25 + 2.75/73 = 2583 
or? = (3.75 — 2583+(1.25 — 2583)°+(2.75 -2 583)°/(3 -1) = 1.583 
ORithin = (0.5? + 0.957? + 1.2587/3 = 0.916 


Ferwcen = 41.583) = 6.332 

F = 6332/0916 = 691 

With dfruiersior = 3 — 1 = 2, ficoominatce = X4 — 1) = 9, and a = .05, 
the critical Fvalue is Faas! = 4.26. Since 6.91 > 4.26, 
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| 

| there és sufficient evidence that the three honeys do not have the same 

| effectiveness. 

| 2, 23.0 + 129 + 86/3 = 14.83 

[23.0 — 14.837+(129 ~ 14.83)°+(8.6 — 14.8373 ~ 1) = 54.64 

11.2 + 88? + 6.49)/3 = 81.28 
Oherween = 2554.64) = 1366 
F = 1366/81.28 = 164 
With Ufnumernoe = 3 ~ 1 = 2. dicrominnce = 425 — 1) = 72, and a = 01, 
the critical Evalue is Figs) = 4.94. Since 16.81 > 4.94, there is sufficient 
evidence that the three treatments produce different results. 

3. 3% = 3.73, = 427, = 3.97, 77, % = 447 

04, 5 = 0.594, 55 = 0.769, 5, = 0.704, 55 = 0.896 


(3,73-a.042) + (4.274.042 + (3.97-4.082) + 

(3.77-4.042 + (4.47=4.082(5-1) = 0.103 

GRinin = (0.904? + 0.594? + 0.7692 + 0.704? + 0.8967)/5 = 0.612 
Reeween = 150.103) = 1.545 

F= 1545/0612 = 2:52 

With Arumersue = 5— 1 = 4, Mscovaninnce = X15 — 1) = 70, and a = 01, 
the critical Evalue is Fics = 3:62. Since 2,52 < 3.62, there is not sufficient 
evidence at the 1% significance level to conclude that some franchises 
prepare orders quicker than others, 


Chapter 16 


ay 
= BxED ia 


39 + 1.645(.0218) = 39 = .036 


2 
0, = yo SS = 0137 


75 * 196.0137) = .75 + 027 


3. a. B= 26/130 = 2 
- 2@ 
oR = a3 79352 


2 + 2,58(.0351) = 2 = 091 

b. 13,000(.2 + .091) = 2,600 + 1,180 
4. a. 5 + 233.0236) = 5 = 055 

b. .32 + 1.88(.0342) = 32 + .064 

c. 31 + 128.0661) = 31 + .085 


A-B-2 = 7=Z= 08 


put 
P- m+ 


1,2 
asaxsie( 5 4) 0546 
€ = 0 + 1.645(.0546) = .107. 
Since 219 — 031 = .188 > .107, there és sufficient evidence at the 5% 
significance level to say that a higher proportion of blacks than whites 
receive the death penalty. 


= 182 


+a aE Ee 


Critical values are 0 + 1.96(.0477) = + .0935. 
Since 906 — .881 = .025 is within this range, there is not sufficient 
evidence at the 5% significance level to say that the probability Bird will 
make a second free throw depends on whether he makes the first 
08240) = 20 .255(240) = 


p= 20+6 
P20 +240 


= 10 


1). 
= faoxaai{ 3; +35) 0342 
¢ = 0 — 2.3%.0342) = —.0797. 
Since 083 ~ 255 = ~.172 < —.0797, there és sufficient evidence at the 
196 significance level to say that AZT helps prevent the HIV virus from 
being passed from mother to child. 


9%875) = 805 93(875) = B14 


5 = 805+ 814 
875 + 875 


m = |sasxors{ 3 +22) = 0126 


(92 — 93)/.0126 = —0.79 and the p-value of this two-tailed test is 2.5 — 
.2852) = 4296. 

With such a high p-value, the difference in survival rates is not statistically 
significant. 


= 925 
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8. a, 
= O14 
(552 — 364) + 2.05(.0114) = 188 + 0234 
b = . 551 = ; 
A-Bl-o8 7-3 - a0 
1.915.085) | (447.553) 
oa tS = cons 
(915 — 447) + 233.0204) = 468 + 0475 
c 383 _ 5-43. 
arg = 8B = aay = 19 
z 383 + 473, m 
4710 + 2979 
1 = 
= yunnnssof + =e) 0074 


¢ = 0 — 2.58(,0074) = —.0190, 
Since .081 ~ 159 = ~.078 < ~.0190, there is sufficient evidence that a 
Bester proportion of women tan men ear unde $10,000 


3776 _ 5 = 2455. 
A 4710 sd Pi, 2979 ot 
3776 + 2455 
Pp 4710 + 2979 pie 


1). 
(810X. wos + as) 0092 
Critical values are 0 + 2.24(.0092) = + 021. 
Since .802 — .824 = —.022 is outside this range, the difference between 
the proportions of men and women earning between $10,000 and $50,000 
4s significant at the 2.5% significance level 
9. 1.645¢5V2/Vn) = .03, Vn = 38.77, n = 1503.3, so the researcher should 
choose a sample size of at least 1504 


Chapter 19 
1, .20(625) = 125, .25(625) = 156.25, .35(625) = 218.75 
2 = G1O= 125) , (140 ~ 15625" , (250 = 218752 
125 15625 218.75, 


25 — 125" 
Bs 


= 7954 
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With df= 4 — 1 = 3 and a = .05, the critical x? value is 7.815. Since 7.954 
> 7.815, there és sufficient evidence at the 5% significance level to say that 
the percentages have changed. 

2, We have that 2+ 3+3+4+6= 18and 


2 = 2 - 
7 (6000) = 667 5 (6000) = 1000 
4 6 
75 (6000) = 13335, (6000) = 2000 
2 = (720-667 | (970 ~ 1000)? 1013 ~ 10002 
x 7 7000 7000 
(4380 = 1333, (1917 ~ 200092 
BSS 2000 Ce 
With df= 5—1 = 4 and a = 05 the critical x? value is 9.448; with a = 


025 the critical x? value is 11.14, Thus there és sufficient evidence to reject 
the superintendent's claim at the 5% significance level but not at the 2.5% 
level. 


(1372 + 1578 + 1686)/3 = 1545 


2 = (372 = 151592 , 1578 = 1545), 1696 — 1545)* 
1545 1345 1545 


= 3294 


With df= 3 ~ 1 = 2and a = .05, the critical x? value is 5.991. Since 32,94 
> 5.991, there is sufficient evidence at the 5% significance level to say that 
the number of accidents on each shift are not the same. 

4, PCS2) = (2) + 5(8X.2)* + 108712) = 05792, 
PB) = 10. 8)%.2)? = 2048, PA) = 5C8)%.2) = 4096, 
POS) = (.8)° = 32768, .05792(200) = 11.6, .2048(200) = 41.0, .4096(200) 
= 81.9, .32768(200) = 65.5. 


2 G=M6Y , G4 ~ 410% , (0-819 , (71 - 65592 
116 410 B19 655 


= 6213 


‘With df= 4— 3and a = .05, the critical x? value is 7.815. Since 6.213 
< 7.815 we conclude that the observed data do not differ significantly from 
what would be expected for a binomial distribution with = = 8. 

5, 245(0) + 1701) + 50(2) + 35(3) = 375, p = 37513(500)1 = 25. 
€75)X500) = 211, 4.25X.75)(500) = 211, 
34.25)4.75(500) = 70, (.25)%500) = 8. 


= 245-21? | 070-217, Go-70F , GS-8» _ 
MOS Seaiipgs Monae, Ce aTOn gs xB a 


103 


With df= 4 — 2 = 2 and a = 01, the critical x? value is 9.21. Since 110.3 
> 9.21 there is sufficient evidence that the distribution is not binomial. 
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PCO) = e24 =091 PUI) = 24e“24 = 218 


3 
pay = SP os = 261 73) = SP ors = 209 
G4 
Peay = SP" oes = 125 
PG oF more) = 1 — (091+ 218+ 261+.209+.125) = .096 


.091(83) = 7.6, 21883) = 18.1, 261(83) = 21.7, 
2083) = 17.3, .125(83) = 10.4, .096(83) = 8.0. 


G—76F , OS ~181P | 8 ~217F , (25-173 

2 

x 57 TERROR Gpees eat “© en haa 171 
(6-104 = 80y _ 

+ ams aad * baer Y. ei $100 


With df= 6 — 1 = 5 and a = .05, the critical x? value is 11.07, 
Since 51.00 > 11.07, there is sufficient evidence that the distribution is not 
Poisson with & = 24. 


5(0)+16(1)+17(2)+8(3)+4(4) = 90, 90/50 = 1.8, 


1850) = 83 Be *(50) = 149 


GB» -14(50) = 290 ABM 418050) = 36 
z co 
2 = G=A3F , U6- 149% | 7-134) | (&~ ROP 
x a m9 4 80 
= 36% _ 
+S « 205 


With df= 5 ~ 2 = 3 and a = .10, the critical x? value is 9.236. 
Since 2.405 < 9,236, the observed data do not differ significantly from 
what would be expected for a Poisson distribution 


.0668(100) = 6.68, .2417(100) = 24.17, 3830(100) = 38.30 
2 = G0 = 668) , 2~ 2417", G3 ~ 3837, 5 ~ 241792 
668 Bay 383 2417 


G0 — 668% 
ee 


= 10049 


With df= 5 — 1 = 4 and a = .025, the critical x2-value is 11.14, Since 
10.049 < 11.14, the observed data do not differ significantly from what 
would be expected for a normal distribution with 4 = 55 and ¢ = 10. 


Chapter 20 
aS Vertex No vertex 
baldness baldness Expected 
Heart 24 451 665 180, 485, 
acs 
Other 175 7 7 209 563 
admissions 
= 1m 


(665 389)/1437 = 180, (665 1048)/1437 = 485, 
(772X389)/1437 = 209, (72 1048)/1437 = 563 


2 = 214-180" + (451 — 485 + 75 — 209 7 (597 — 563)" 
is east 5 20 363 
= 1639 


With df = (2 ~ 1X2 ~ 1) = Land a = .05, the critical x? value is 3.841. 
Since 16.39 > 3.841, there és sufficient evidence at the 5% significance level 
to say that a relationship exists between heart attacks and vertex baldness. 


Aspirin Placebo 


Fatal 
amack 
Nonfatal 
attack 
No heart 
attack 


11037 11,034 


- GH Use , 8115) | (99 = 135) | 471 ~ 1359 
115 115 135 135 

(10933 — 108905 
108905, 


+ 


With df= (3 — 1X2 - 1) = 2 and @ = 05, the critical x?-value is 5.991. 
Since 26.88 > 5.991, there és sufficient evidence at the 5% significance level 
to say that a relationship exists between taking aspirins and the risk of 
heart attacks. 
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With df= (2 ~ 1X3 ~ 1) = 2and a = .10, the critical x2-value is 4,60. 
Since 3.018 < 4.605 there is not sufficient evidence at the 10% significance 
level of a relationship between cholesterol level and risk of heart disease 
for men over 70 years old 
4a (57 -718F , G4 - 364% , (24-177 , 7 - 61 
x ja. Mn ae 
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4 (548 = 5332), (273 = 2706 | (125 ~ 1313) 
5532 706 1313 
(A= 49 ayy 


arr 


With df= (2 — 1X4 — 1) = 3.and a = 01, the critical y*-value is 13.28. 
Since 28.31 > 13.28, there és sufficient evidence at the 1% significance level 
of a relationship between amount of fecal material in the water and 
reported gastroenteritis illness 

5, Since 4.18 < 9.448 there is not sufficient evidence at the 5% significance 
level of a relationship between hair loss pattern and body mass index. 

6, Since 87.6 > 19.02 there és sufficient evidence at the 2.5% significance level 
of a relationship between fitness level and smoking habits 


Chapter 21 
1, Slope is 20/10 = 2, that is, a woman's risk of ovarian cancer rises 2% for 
every gram of fat consumed per day. 
2. a. y= 0.0227x + 707 where <x represents dollars spent per student and 
represents the average SAT total scores. 
b, Each additional $1000 spent per student corresponds to a rise of 22,7 
in the average SAT total scores. 
¢. 0,0227(10,000) + 707 = 934 
, 0.0227x + 707 = 1000 gives x = $12,907. 


Answers to Exercises 301 


5. b. y= 485.34 e063", r= 993 
c. When x = 115, y = $59,583. 
25,000 = 485.34 90%" gives x = 60, or 1995 as the year when 
personal income will reach $25,000. 
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